Fr. 169.00

Evolutionary Decision Trees in Large-Scale Data Mining

Inglese · Copertina rigida

Spedizione di solito entro 6 a 7 settimane

Descrizione

Ulteriori informazioni


This book presents a unified framework, based on specialized evolutionary algorithms, for the global induction of various types of classification and regression trees from data. The resulting univariate or oblique trees are significantly smaller than those produced by standard top-down methods, an aspect that is critical for the interpretation of mined patterns by domain analysts. The approach presented here is extremely flexible and can easily be adapted to specific data mining applications, e.g. cost-sensitive model trees for financial data or multi-test trees for gene expression data. The global induction can be efficiently applied to large-scale data without the need for extraordinary resources. With a simple GPU-based acceleration, datasets composed of millions of instances can be mined in minutes. In the event that the size of the datasets makes the fastest memory computing impossible, the Spark-based implementation on computer clusters, which offers impressive fault tolerance and scalability potential, can be applied.

Sommario

Evolutionary computation.- Decision trees in data mining.-  Parallel and distributed computation.- Global induction of univariate trees.- Oblique and mixed decision trees.- Cost-sensitive tree induction.- Multi-test decision trees for gene expression data.- Parallel computations for evolutionary induction.

Riassunto

This book presents a unified framework, based on specialized evolutionary algorithms, for the global induction of various types of classification and regression trees from data. The resulting univariate or oblique trees are significantly smaller than those produced by standard top-down methods, an aspect that is critical for the interpretation of mined patterns by domain analysts. The approach presented here is extremely flexible and can easily be adapted to specific data mining applications, e.g. cost-sensitive model trees for financial data or multi-test trees for gene expression data. The global induction can be efficiently applied to large-scale data without the need for extraordinary resources. With a simple GPU-based acceleration, datasets composed of millions of instances can be mined in minutes. In the event that the size of the datasets makes the fastest memory computing impossible, the Spark-based implementation on computer clusters, which offers impressive fault tolerance and scalability potential, can be applied.

Testo aggiuntivo

“The structure of the book is well-thought-out. … I recommend the book for students, researchers, and developers interested in real-life applications of big data analysis.” (K. Balogh, Computing Reviews, February 15, 2021)

Relazione

"The structure of the book is well-thought-out. ... I recommend the book for students, researchers, and developers interested in real-life applications of big data analysis." (K. Balogh, Computing Reviews, February 15, 2021)

Dettagli sul prodotto

Autori Marek Kretowski
Editore Springer, Berlin
 
Lingue Inglese
Formato Copertina rigida
Pubblicazione 01.01.2019
 
EAN 9783030218508
ISBN 978-3-0-3021850-8
Pagine 180
Dimensioni 158 mm x 243 mm x 19 mm
Peso 403 g
Illustrazioni XI, 180 p. 69 illus.
Serie Studies in Big Data
Categorie Scienze naturali, medicina, informatica, tecnica > Tecnica > Tematiche generali, enciclopedie

B, Künstliche Intelligenz, Big Data, Artificial Intelligence, engineering, Technology: general issues, Data Engineering, Computational Intelligence, Databases, Engineering—Data processing

Recensioni dei clienti

Per questo articolo non c'è ancora nessuna recensione. Scrivi la prima recensione e aiuta gli altri utenti a scegliere.

Scrivi una recensione

Top o flop? Scrivi la tua recensione.

Per i messaggi a CeDe.ch si prega di utilizzare il modulo di contatto.

I campi contrassegnati da * sono obbligatori.

Inviando questo modulo si accetta la nostra dichiarazione protezione dati.