LLMs and a possible future for SearchThe recent surge of Generative AI has pushed the boundaries of what was previously thought possible.Dec 22, 2022A response icon1Dec 22, 2022A response icon1
OpenAI and the road to text-guided image generation: DALL·E, CLIP, GLIDE, DALL·E 2 (unCLIP)Let’s look at the evolution of text-guided image generation models from OpenAI, as there are more datapoints than just DALL·E and DALL·E 2.May 1, 2022A response icon1May 1, 2022A response icon1
Foundation ModelsIn August 2021 Stanford announced establishing the Center for Research on Foundation Models (CRFM) as part of the Stanford Institute for…Nov 22, 2021Nov 22, 2021
Hardware for Deep Learning. Part 4: ASICThis is a part about ASICs from the “Hardware for Deep Learning” series. The content of the series is here.Jan 12, 2021A response icon1Jan 12, 2021A response icon1
JAXJAX by Google Research is getting more and more popular. Deepmind recently announced they are using JAX to accelerate their research and…Dec 20, 2020A response icon1Dec 20, 2020A response icon1
GPT-3: Language Models are Few-Shot LearnersOpenAI just published a paper “Language Models are Few-Shot Learners” presenting a recent upgrade of their well-known GPT-2 model — the…Jun 2, 2020Jun 2, 2020
FP64, FP32, FP16, BFLOAT16, TF32, and other members of the ZOOThere are many floating point formats you can hear about in the context of deep learning. Here is a summary of what are they about and…May 16, 2020A response icon2May 16, 2020A response icon2
Adaptive Computation Time (ACT) in Neural Networks [3/3]Part 3: ACT in TransformersJan 7, 2020Jan 7, 2020
Adaptive Computation Time (ACT) in Neural Networks [2/3]Part 2: ACT in Residual NetworksJan 3, 2020Jan 3, 2020