Data Patterns and Algorithms for Modern Applications
Buch, Englisch, 286 Seiten, Format (B × H): 178 mm x 254 mm, Gewicht: 576 g
ISBN: 978-1-4842-3314-6
Verlag: Apress
Many of these techniques are recent developments, still not in widespread use. Others are standard algorithms given a fresh look. In every case, the focus is on practical applicability, with all code written in such a way that it can easily be included into any program. The Windows-based DATAMINE program lets you experiment with the techniques before incorporating them into your own work.
What You'll Learn
- Use Monte-Carlo permutation tests to provide statistically sound assessments of relationships present in your data
- Discover how combinatorially symmetric cross validation reveals whether your model has true power or has just learned noise by overfitting the data
- Work with feature weighting as regularized energy-based learning to rank variables according to their predictive power when there is too little data for traditional methods
- See how the eigenstructure of a dataset enables clustering of variables into groups that exist only within meaningful subspaces of the data
- Plot regions of the variable space where there is disagreement between marginal and actual densities, or where contribution to mutual information is high
Anyone interested in discovering and exploiting relationships among variables. Although all code examples are written in C++, the algorithms are described in sufficient detail that they can easily be programmed in any language.
Zielgruppe
Professional/practitioner
Autoren/Hrsg.
Fachgebiete
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken Data Mining
- Mathematik | Informatik EDV | Informatik Programmierung | Softwareentwicklung Programmier- und Skriptsprachen
- Mathematik | Informatik EDV | Informatik Programmierung | Softwareentwicklung Objektorientierte Programmierung
- Mathematik | Informatik EDV | Informatik Programmierung | Softwareentwicklung Algorithmen & Datenstrukturen
Weitere Infos & Material
1. Information and Entropy.- 2. Screening for Relationships.- 3. Displaying Relationship Anomalies.- 4. Fun With Eigenvectors.- 5. Using the DATAMINE Program.