The Engineer’s Guide To Deep Learning
We are in the third golden age of AI.
In the previous two golden ages (1950s-1960s and the 1980s), our expectations outpaced the capabilities of the technology at the time, leading to disappointment. In contrast, the AI technology of the current golden age, which began in the mid-2010s, has consistently exceeded our expectations.
Among AI technologies, the Transformer, introduced in 2017, stands as a groundbreaking breakthrough. Initially developed as a machine translation model, its impact has extended to permeate nearly every field. Today, the Transformer model is considered essential knowledge for modern engineers.
The first goal of this document is to provide the shortest path for engineers to understand the Transformer.
What is this document
- A concise guidebook:
This document provides just enough information to learn the Transformer.
What this document provides
-
Working Python code examples for hands-on learning:
To enhance comprehension, this document provides working Python code examples that readers can run themselves. -
References for further exploration:
This document introduces readers to a variety of documentation options, recognizing that different individuals find different resources more accessible.
- Part 1: Neural Networks
Introduces the fundamental concepts of neural networks. - Part 2: Recurrent Neural Networks (RNNs)
Explores RNNs, including LSTM and GRU. - Part 3: Natural Language Processing (NLP) and Attention Mechanisms
Provides the essential principles of NLP, encompassing machine translation and attention mechanisms. - Part 4: Transformer
Unravels the Transformer model. - Appendix: Basic Knowledge
Provides the minimum knowledge of Python and mathematics required to understand the Transformer.
Next goal
Many Transformer-based technologies are currently being developed. There will definitely be another major breakthrough in the near future. I might write about them if I have time.
Copyright
© Copyright ALL Right Reserved, Hironobu SUZUKI.
For any inquiries regarding the use of this document or any of its figures, please contact me after reading the following FAQ:
When you send me an email, please provide at least two SNS addresses (e.g. LinkedIn, Twitter) for verification purposes. Due to the XZ backdoor incident, I no longer accept contact from anonymous individuals.
Exception Educational institutions can use this document freely.
Author
Hironobu SUZUKI
I am a software programmer/engineer, the author of:
I graduated from graduate school in information engineering (M.S. in Information Engineering), had worked for several companies as a software developer and technical manager/director, and published seven books (4 PostgreSQL books and 3 MySQL books) in Japanese and a Chinese book.
As a director of the Japan PostgreSQL Users Group (2010-2016), I organized the largest (non-commercial) technical seminar/lecture on PostgreSQL in Japan for more than six years, and also served as the program committee chair of the Japan PostgreSQL Conference in 2013 and as a member in 2008 and 2009. In June 2022, my interview article was published.
Cuando era joven, vivió en Sudamérica por unos años. Recientemente, a veces vuelve a allí.
I am looking for a new job, applying ML and AI technologies to DBMS.
I’m interested in History, Animal Rights, Cosmology, Social Issues, Environment Issues. I play the piano and guitar. Vegetarian. I love animals, music, science.