Researchers from Princeton and Meta AI Introduce ‘Lory’: A Fully-Differentiable MoE Model Designed for Autoregressive Language Model Pre-Training

Mixture-of-experts (MoE) architectures use sparse activation to initial the scaling of model sizes while preserving high training and inference efficiency. However, training the router network creates the challenge of optimizing a non-differentiable, discrete objective despite the efficient scaling by MoE models. Recently, an MoE architecture called SMEAR was introduced, which is fully non-differentiable and merges experts gently in the parameter space. SMEAR is very efficient, but its effectiveness is limited to small-scale fine-tuning experiments on downstream classification tasks.

Read More
Nvidia Blackwell GB200

Introducing the NVIDIA Blackwell Platform: Unveiling the B200, the Flagship AI Chip, for Pioneering Computing and Generative AI

GTC—Powering a new era of computing, NVIDIA on Monday 18th of March 2024 announced that the NVIDIA Blackwell platform has arrived — enabling organizations everywhere to build and run real-time generative AI on trillion-parameter large language models at up to 25x less cost and energy consumption than its predecessor.

Read More
Generative AI

Unlocking Enterprise Success: 10 Impactful Use Cases of NLP Generative AI

In a world increasingly dominated by artificial intelligence (AI) and the promise of groundbreaking applications like ChatGPT, enterprises are seeking concrete ways to harness AI’s potential for tangible benefits. Through our extensive collaborations with leading technology consulting firms and direct interactions with businesses, we’ve pinpointed 10 Natural Language Processing (NLP) and generative AI use cases that not only resolve longstanding organizational challenges but are also exceptionally well-suited for AI solutions, given today’s cutting-edge technology.

Read More
OpenAI

OpenAI’s Residency Program: Bridging Minds for AI Advancement

Artificial intelligence has been transforming the way we live and work, and OpenAI, a renowned AI research and deployment company, is at the forefront of this revolution. They understand that to create AI systems that truly benefit humanity, they need a diverse set of skills and backgrounds reflecting the human experience. To achieve this, OpenAI has launched its Residency Program, offering a unique opportunity for exceptional engineers and researchers from various fields to embark on a six-month journey into the world of AI.

Read More