Megatron

5 Posts

Different Nvidia cloud-computing services

Chipmaker Boosts AI as a Service: Nvidia Launches Cloud Service for NLP Models

Nvidia, known for chips designed to process AI systems, is providing access to large language models. Nvidia announced early access to NeMo LLM and BioNeMo, cloud-computing services that enable developers to generate text and biological sequences respectively.

Megatron

Yoav Shoham: Language models that reason

I believe that natural language processing in 2022 will re-embrace symbolic reasoning, harmonizing it with the statistical operation of modern neural networks. Let me explain what I mean by this.

Illustration of giant Christmas tree in a town plaza

Megatron

Trillions of Parameters: Are AI models with trillions of parameters the new normal?

The trend toward ever-larger models crossed the threshold from immense to ginormous. Google kicked off 2021 with Switch Transformer, the first published work to exceed a trillion parameters, weighing in at 1.6 trillion.

Two images showing RETRO Architecture and Gopher (280B) vs State of the Art

Megatron

Large Language Models Shrink: Gopher and RETRO prove lean language models can push boundaries.

DeepMind released three papers that push the boundaries — and examine the issues — of large language models.

Megatron

Bigger is Better: A research summary of Microsoft's Turing-NLG language model.

Natural language processing lately has come to resemble an arms race, as the big AI companies build models that encompass ever larger numbers of parameters. Microsoft recently held the record — but not for long.

Megatron

Chipmaker Boosts AI as a Service: Nvidia Launches Cloud Service for NLP Models

Yoav Shoham: Language models that reason

Trillions of Parameters: Are AI models with trillions of parameters the new normal?

Large Language Models Shrink: Gopher and RETRO prove lean language models can push boundaries.

Bigger is Better: A research summary of Microsoft's Turing-NLG language model.

Subscribe to The Batch