Fr. 106.00

Boris Hanin, Boris (Princeton University Hanin, Daniel A. Roberts, Daniel A. (Massachusetts Institute of Tec Roberts, Daniel A. (Massachusetts Institute of Technology) Roberts, Roberts Daniel A....

Principles of Deep Learning Theory - An Effective Theory Approach to Understanding Neural Networks

Inglese · Copertina rigida

Spedizione di solito entro 1 a 3 settimane (non disponibile a breve termine)

Descrizione

Ulteriori informazioni

"This textbook establishes a theoretical framework for understanding deep learning models of practical relevance. With an approach that borrows from theoretical physics, Roberts and Yaida provide clear and pedagogical explanations of how realistic deep neural networks actually work. To make results from the theoretical forefront accessible, the authors eschew the subject's traditional emphasis on intimidating formality without sacrificing accuracy. Straightforward and approachable, this volume balances detailed first-principle derivations of novel results with insight and intuition for theorists and practitioners alike. This self-contained textbook is ideal for students and researchers interested in artificial intelligence with minimal prerequisites of linear algebra, calculus, and informal probability theory, and it can easily fill a semester-long course on deep learning theory. For the first time, the exciting practical advances in modern artificial intelligence capabilities can be matched with a set of effective principles, providing a timeless blueprint for theoretical research in deep learning"--

Sommario

Preface; 0. Initialization; 1. Pretraining; 2. Neural networks; 3. Effective theory of deep linear networks at initialization; 4. RG flow of preactivations; 5. Effective theory of preactivations at initializations; 6. Bayesian learning; 7. Gradient-based learning; 8. RG flow of the neural tangent kernel; 9. Effective theory of the NTK at initialization; 10. Kernel learning; 11. Representation learning; ¿. The end of training; ¿. Epilogue; A. Information in deep learning; B. Residual learning; References; Index.

Info autore

Daniel A. Roberts was cofounder and CTO of Diffeo, an AI company acquired by Salesforce; a research scientist at Facebook AI Research; and a member of the School of Natural Sciences at the Institute for Advanced Study in Princeton, NJ. He was a Hertz Fellow, earning a PhD from MIT in theoretical physics, and was also a Marshall Scholar at Cambridge and Oxford Universities.Sho Yaida is a research scientist at Meta AI. Prior to joining Meta AI, he obtained his PhD in physics at Stanford University and held postdoctoral positions at MIT and at Duke University. At Meta AI, he uses tools from theoretical physics to understand neural networks, the topic of this book.Boris Hanin is an Assistant Professor at Princeton University in the Operations Research and Financial Engineering Department. Prior to joining Princeton in 2020, Boris was an Assistant Professor at Texas A&M in the Math Department and an NSF postdoc at MIT. He has taught graduate courses on the theory and practice of deep learning at both Texas A&M and Princeton.

Riassunto

This is the first book focused entirely on deep learning theory. Tools from theoretical physics are borrowed and adapted to explain, from first principles, how realistic deep neural networks work, benefiting practitioners looking to build better AI models and theorists looking for a unifying framework for understanding intelligence.

Dettagli sul prodotto

Autori	Boris Hanin, Boris (Princeton University Hanin, Daniel A. Roberts, Daniel A. (Massachusetts Institute of Tec Roberts, Daniel A. (Massachusetts Institute of Technology) Roberts, Roberts Daniel A., Sho Yaida
Editore	Cambridge University Press ELT

Lingue	Inglese
Formato	Copertina rigida
Pubblicazione	30.06.2022

EAN	9781316519332
ISBN	978-1-316-51933-2
Pagine	472
Categorie	Scienze naturali, medicina, informatica, tecnica > Fisica, astronomia > Altro machine learning, Artificial Intelligence, SCIENCE / Physics / General, SCIENCE / Physics / Mathematical & Computational, Artificial Intelligence (AI), Mathematical physics, Statistical physics, COMPUTERS / Artificial Intelligence / General, COMPUTERS / Data Science / Machine Learning, Mathematical / Computational / Theoretical physics

Recensioni dei clienti

Per questo articolo non c'è ancora nessuna recensione. Scrivi la prima recensione e aiuta gli altri utenti a scegliere.

Scrivi una recensione

Top o flop? Scrivi la tua recensione.

Il tuo contatto a CeDe