Products & LaunchesScore: 55

What is the benefit of adamax over adam? : r/MachineLearning

What follows is just my interpretation after reading the paper you linked. The paper says that the infinite-order norm makes the algorithm surprisingly stable. The update rule is u_t = max(b_2 u_{t-1}, |g_t|) so gt is completely ignored when it's close to zero. This means that u1, u2, ..., un are in

GAla Smith & AI Research Desk·2h ago·1 min read·16 views·AI-Generated

Share:

Source: reddit.comSingle Source

What follows is just my interpretation after reading the paper you linked. The paper says that the infinite-order norm makes the algorithm surprisingly stable. The update rule is u_t = max(b_2 u_{t-1}, |g_t|) so gt is completely ignored when it's close to zero. This means that u1, u2, ..., un are influenced by fewer gradients and this makes the algorithm more robust to noise in the gradients.

Source: Discussion on r/MachineLearning

Enjoyed this article?

Share:

Get the weekly AI intelligence briefing

Related Articles

Products & Launches

How Weaviate Agent Skills Let Claude Code Build Vector Apps in Minutes

Products & Launches

Claude Code's New Channels Feature: How to Run Persistent AI Agents in Your Terminal

Products & Launches

Anthropic Rumored to Develop 'Mythos' and 'Capybara' Models, With Mythos Positioned as Premium Tier Above Claude 3.5 Opus

Products & Launches

Claude Code's Opus 4.6 Outage: How to Switch Models and Keep Working

Products & Launches

Claude Code Digest — Mar 24–Mar 27

Products & Launches

What 'Mythos' Means for Claude Code: How to Prepare for the Next Model Leap

More in Products & Launches

Anthropic Considers Q4 IPO with Potential $60B+ Valuation

Products & LaunchesBreakingBreakthrough

Anthropic Considers Q4 IPO with Potential $60B+ Valuation

Anthropic executives have reportedly discussed launching an initial public offering as early as the fourth quarter of this year. Bankers competing to...

youtube.com·1h ago·3 min read

China Advances Physical AI Integration with Humanoid Robo…

Products & LaunchesBreakingBreakthrough

China Advances Physical AI Integration with Humanoid Robots and Smart Factories

China is accelerating development of physical AI systems that combine large language models with robotics and industrial automation, aiming to create...

youtube.com·1h ago·3 min read