Gang Fang's Blog

System 2 AI, Improv Dancing and {My Curiosity}.

    • About
  • DL implementation study – Feb 16, 2024

    makemore repo with the completed bigram model (lecture 2): https://github.com/gangfang/makemore Notes: Questions:

    gfang1212

    February 16, 2024
    DL skill training
  • DL implementation study – Jan 5, 2024

    building makemore before part 2: # Doing the same thing but with a ANN # imagine how the model looks like before watching video: # input is a char, output is the next # one-hot encoding to make each output neuron a bin classifier # softmax to get the prob dist # N is not…

    gfang1212

    February 5, 2024
    DL skill training, Uncategorized
  • DL study – Feb 4, 2024

    Notes

    gfang1212

    February 5, 2024
    AI
  • DL study – Feb 3, 2024

    Building Makemore The No Free Lunch Theorem and OOD generalization I was wondering if The NFL Theorem is an implication of the impossibility of OOD generalization. NFL theorem talks about the impossibility of creating a universal learning algorithm, which sounds similar to the goal of out-of-dist generalization. But I think there are nuances that make…

    gfang1212

    February 3, 2024
    AI
  • Generalization and the Scaling laws – Jan 31, 2024

    Traditionally, the central challenge in ML is generalization, or out of distribution generalization. The problem of generalization only occurs when the model is used to predict on previously unobserved inputs. However, with the current approach of LLM training, which uses the “entire Internet”, there doesn’t seem to be any more unobserved data and therefore, OOD…

    gfang1212

    January 31, 2024
    System 2 AI
  • DL and intelligence – Jan 28, 2024

    This talk has been phenomenally interesting and here are some notes:

    gfang1212

    January 28, 2024
    AI
  • DL study – Jan 27, 2024

    Building makemore Lecture: https://youtu.be/-u_5ukgYyhg?si=i2mXgp-U95shYtUz

    gfang1212

    January 28, 2024
    AI, System 2 AI
  • DL implementation study – Jan 26, 2024

    Building makemore bigram

    gfang1212

    January 26, 2024
    DL skill training
  • DL study – Jan 25, 2024

    The scaling law I have been hearing about, how much truth is there to it? I think to look at it from the opposite angle, the question is: how much neural architecture matters? Or does all those invariants or biases that different architectures possess are simply a “shortcut”, which facilitates more complex problem solving with…

    gfang1212

    January 25, 2024
    AI, Cognitive Science, DL skill training, System 2 AI
  • DL implementation study – Jan 24, 2024

    Completed micrograd exercise: https://github.com/gangfang/micrograd/blob/main/micrograd_exercises.ipynb

    gfang1212

    January 24, 2024
    DL skill training
Previous Page
1 2 3 4 5 6 … 13
Next Page

Blog at WordPress.com.

  • Subscribe Subscribed
    • Gang Fang's Blog
    • Already have a WordPress.com account? Log in now.
    • Gang Fang's Blog
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar