- AI Pitstop
- Posts
- Hugging Face's New LLM Leaderboard Sets Higher Bar for Language Models
Hugging Face's New LLM Leaderboard Sets Higher Bar for Language Models
Happy Friday, Team!
June 28 is known for the signing of the Treaty of Versailles in 1919, ending World War I. As we reflect on historical milestones, today's focus shifts to the future of technology. Hugging Face has launched a new LLM leaderboard, setting a higher standard for evaluating AI performance. Let's explore the latest breakthroughs in AI shaping tomorrow's world.
TRACK HIGHLIGHTS
Article Takeaways
• Hugging Face's New LLM
• Google Translate's AI-Powered Leap: 110 New Languages Added
• Goldman Sachs: AI Bubble Warnings
• AI Michaels for Personalized Olympic Recaps
• Character.AI Launches AI Avatar Calls
MARKET PITSTOP
A Quick Look at the Market
*Stock Market prices as well as Cryptocurrency prices and data are as of 3:00pm CST of the previous Stock Market trading day.
Stock Market: S&P 500 closes little changed as investors brace for Fed’s preferred inflation gauge. Read More
Crypto: Bitcoin (BTC) options worth $6.68 billion and Ethereum (ETH) options worth $3.5 billion are slated to expire on Deribit, the leading crypto derivatives exchange, this Friday. Read More
LAP 1 Featured Story
Hugging Face's New LLM Leaderboard Sets Higher Bar for Language Models
• Hugging Face launches a new LLM leaderboard.
• Alibaba's Qwen models secure top positions.
• Tests cover knowledge, reasoning, math, and instruction.
• Closed-source models excluded for reproducibility.
Image via Shutterstock
Hugging Face has released its second LLM leaderboard, aiming to establish a more challenging standard for evaluating large language models across diverse tasks. Alibaba's Qwen models dominate the inaugural rankings, securing three of the top ten spots. The new leaderboard tests models on knowledge, reasoning, complex math, and instruction following, using benchmarks such as solving murder mysteries and explaining PhD-level concepts in simple terms.
Notably, closed-source models like ChatGPT are excluded to ensure reproducibility, and the tests are conducted on Hugging Face's own computers powered by 300 Nvidia H100 GPUs. This initiative seeks to address the issue of over-training on previous benchmarks, which has led to a decline in real-world performance for some models. Read More
LAP 2
Google Translate's AI-Powered Leap: 110 New Languages Added
• Google Translate adds 110 new languages using AI
• PaLM 2 enhances efficiency by learning related languages
• 614 million speakers now have support in Translate
Image via Getty Images
Google has expanded Google Translate to include 110 new languages, reaching 614 million speakers worldwide. This expansion is facilitated by their large language model, PaLM 2, which efficiently learns languages that are closely related to each other. Notably, around a quarter of the newly added languages come from Africa. This addition follows the Zero-Shot Machine Translation update from May 2022 and aligns with Google's 1,000 Languages Initiative. Google aims to continually enhance language support through advanced technology and collaborations with linguists and native speakers. Read More
BRAIN TEASER
In 1990, a person is 15 years old. In 1995, that same person is 10 years old. How can this be?
WALLSTREET PITSTOP
Goldman Sachs: AI Bubble Warnings
• Goldman Sachs warns of AI bubble, but supports Nvidia and AI infrastructure investments
• Analyst Jim Covello highlights skepticism about AI's ability to solve critical business problems
• Nvidia stock surges, driven by AI demand from cloud computing giants and internet companies
Image via Getty Images
Goldman Sachs' top analyst Jim Covello cautions about a potential AI bubble but sees value in investing in Nvidia and AI infrastructure companies. Despite concerns about AI's ability to solve business problems and the high investment costs, Covello believes AI infrastructure providers remain a good investment. Nvidia, a key player in AI, has seen substantial stock growth driven by demand from cloud computing and internet companies. However, Covello warns that investor sentiment could shift if significant AI applications don't emerge soon. Read More
Lap 3
Al Michaels for Personalized Olympic Recaps
• NBC to use AI to replicate Al Michaels' voice for Olympic recaps
• Personalized recaps on Peacock will feature "high-quality" AI narration
• Nearly 7 million personalized recap variants available during the Olympics
Image via Marcio Jose Sanchez / AP
NBC will use artificial intelligence to replicate Al Michaels' voice for the 2024 Paris Olympics, providing personalized daily recaps on Peacock. This innovative feature will offer nearly seven million possible recap variants, tailored to users' interests and narrated by a "high-quality" AI version of Michaels' voice. Initially skeptical, Michaels was impressed by the AI's accuracy, describing it as almost perfect. This new feature will be available on the Peacock app starting July 27, one day after the opening ceremonies. Read More
FUN FACT STOP
Dead skin cells are a main ingredient in household dust
LAP 4
Character.AI Launches AI Avatar Calls
• Character.AI introduces AI avatar calls in multiple languages
• Users can make calls for language practice, mock interviews, and role-playing games
• Seamless switching between calling and texting with AI characters, featuring reduced latency and interruption option
Image via Character.AI
Character.AI has unveiled a new feature allowing users to engage in calls with AI avatars, supporting languages such as English, Spanish, Portuguese, Russian, Korean, Japanese, and Chinese. This capability, tested with over 3 million users and 20 million calls prior to its public launch, facilitates language practice, mock interviews, and integrates with role-playing games. Users can effortlessly transition between calling and texting, benefiting from reduced latency and the ability to interrupt AI responses with a simple tap. Read More
FINAL LAP - FINANCE & THE STOCK MARKET
China Caps Finance Pay, Amazon Embraces AI, U.S. Stocks Dominate, Top City for Finance Workers Revealed
• China’s Finance Elite Face $400,000 Pay Cap, Bonus Clawbacks
• Amazon Leans Into Generative AI to Manage Its Finances
• American stocks are consuming global markets
• The Best U.S. City for Finance Workers to Live In
BRAIN TEASER ANSWER
The person was born in 2005 BC.
THE RACE CONCLUDES
Thank You for Joining us on the AI Race! 🏁
We want to THANK YOU for joining us on this exciting journey into the world of AI! Our mission is simple - to keep those who feel left behind in the AI curve up to date.
Your subscription marks the beginning of a collaborative exploration into the boundless possibilities of Artificial Intelligence.
Warm regards,
AI Pitstop Team
Help us enhance your AI Pitstop experience! Share your feedback here! [email protected]