Learn more about LLMs & more at ► [ Ссылка ] to get started for free for 30 days, and to get 20% off an annual premium subscription!
In this video we're going to answer just how good Large Language Models (LLMs) like ChatGPT 4o, Claude 3.5, and Google's Gemini are at mathematics. I'll cite some of the results from the literature using databases such as GSM8k and MATH, and we'll see several math examples along the way. References below.
0:00 How to measure AI at math?
0:56 GSM8k and GSM-Hard
2:44 The MATH Database
4:43 ChatGPT 4o vs Gemini vs Claude 3.5 Sonnet
6:13 My Linear Algebra Exams
8:32 Computational Engines
10:34 Brilliant.org/TreforBazett
References and Citations:
*GSM8k (including graphic at 1:10 ) [ Ссылка ]
*GSM-Hard stats found in here: [ Ссылка ]
*Google Deepmind paper citing MATH database: [ Ссылка ]
*I first saw the question about the smallest integer here: [ Ссылка ]
*Math Olympiad level problems (5:30): [ Ссылка ]
*Stats for Claude 3.5: [ Ссылка ]
*Image of two calculators at 2:30 shared via CC-BY-SA 3 original here: [ Ссылка ]
BECOME A MEMBER:
►Join: [ Ссылка ]
MATH BOOKS I LOVE (affilliate link):
► [ Ссылка ]
COURSE PLAYLISTS:
►DISCRETE MATH: [ Ссылка ]
►LINEAR ALGEBRA: [ Ссылка ]
►CALCULUS I: [ Ссылка ]
► CALCULUS II: [ Ссылка ]
►MULTIVARIABLE CALCULUS (Calc III): [ Ссылка ]
►VECTOR CALCULUS (Calc IV) [ Ссылка ]
►DIFFERENTIAL EQUATIONS: [ Ссылка ]
►LAPLACE TRANSFORM: [ Ссылка ]
►GAME THEORY: [ Ссылка ]
OTHER PLAYLISTS:
► Learning Math Series
[ Ссылка ]
►Cool Math Series:
[ Ссылка ]
SOCIALS:
►X/Twitter: [ Ссылка ]
►TikTok: [ Ссылка ]
►Instagram (photography based): [ Ссылка ]
ChatGPT is destroying my math exams
Теги
mathchatgptchatgpt 4olarge language modelLLMhow good is chatgpt at mathclaude 3.5claude sonnetgoogle geminigeminigsm8kmath databaseAI researchhow to solve math problems in chatgpthow to solve maths problems in chatgptchatgpt math problemsbest chatgpt for mathfunny questions to ask chatgpthow chatgpt changed society foreverIs chatgpt good at math?generative AI