AI – Page 3 – Gang Fang's Blog

DL implementation study – Jan 21, 2024

Notes Qs:

gfang1212

February 21, 2024

DL skill training

DL implementation study – Feb 16, 2024

makemore repo with the completed bigram model (lecture 2): https://github.com/gangfang/makemore Notes: Questions:

gfang1212

February 16, 2024

DL skill training

DL implementation study – Jan 5, 2024

building makemore before part 2: # Doing the same thing but with a ANN # imagine how the model looks like before watching video: # input is a char, output is the next # one-hot encoding to make each output neuron a bin classifier # softmax to get the prob dist # N is not…

gfang1212

February 5, 2024

DL skill training, Uncategorized

DL study – Feb 4, 2024

Notes

gfang1212

February 5, 2024

AI

DL study – Feb 3, 2024

Building Makemore The No Free Lunch Theorem and OOD generalization I was wondering if The NFL Theorem is an implication of the impossibility of OOD generalization. NFL theorem talks about the impossibility of creating a universal learning algorithm, which sounds similar to the goal of out-of-dist generalization. But I think there are nuances that make…

gfang1212

February 3, 2024

AI

Generalization and the Scaling laws – Jan 31, 2024

Traditionally, the central challenge in ML is generalization, or out of distribution generalization. The problem of generalization only occurs when the model is used to predict on previously unobserved inputs. However, with the current approach of LLM training, which uses the “entire Internet”, there doesn’t seem to be any more unobserved data and therefore, OOD…

gfang1212

January 31, 2024

System 2 AI

DL and intelligence – Jan 28, 2024

This talk has been phenomenally interesting and here are some notes:

gfang1212

January 28, 2024

AI

DL study – Jan 27, 2024

Building makemore Lecture: https://youtu.be/-u_5ukgYyhg?si=i2mXgp-U95shYtUz

gfang1212

January 28, 2024

AI, System 2 AI

DL implementation study – Jan 26, 2024

Building makemore bigram

gfang1212

January 26, 2024

DL skill training

DL study – Jan 25, 2024

The scaling law I have been hearing about, how much truth is there to it? I think to look at it from the opposite angle, the question is: how much neural architecture matters? Or does all those invariants or biases that different architectures possess are simply a “shortcut”, which facilitates more complex problem solving with…

gfang1212

January 25, 2024

AI, Cognitive Science, DL skill training, System 2 AI

Category: AI