skip to content
amar jay
Dark Theme

Posts

  • Understanding AdaNorm

    Understanding Adaptive Layer Normalization. First introduced in the DiT paper
  • Video Codecs

    What are video codes, their evolution, and how to work with them
  • Understanding Squared attention

    Just a brief explanation of how attention mechanism works. As well as the quadratic scaling of attention.
  • Unfulfilled Dream

    A reflective poem about identity, the tension between dreams and reality, and the struggle between belief and self
  • FPGAs

    FPGAs: the ultimate flex by Jon Y from Asianometry
  • C Style

    C code style by Malcolm Inglis
  • Layernorm - Karpathy

    layer normalization of GPT by Andrej Karpathy
  • Layernorm

    layer normalization of GPT by Andrej Karpathy
  • Data Corpus of GPT-3 Training

    Understanding the Text Corpus and Training Datasets of GPT-3
  • Decoder Transformer

    How I understand the Decoder Transformer in Generative Text Models