y combinator

30 articles about y combinator in AI news

YC Startup Aviary Launches Autonomous AI Agent for Outbound Sales

Aviary, a Y Combinator startup, has launched an AI agent designed to run a company's entire outbound sales process autonomously. This represents a significant push toward fully automated, agentic workflows in enterprise SaaS.

Apr 9, 202697% relevant

YC-Backed Ava Raises $36M for Fully Autonomous AI Sales Rep

Ava, a Y Combinator startup, has raised $36 million to develop an AI 'employee' that runs entire outbound sales processes autonomously. The system aims to replace human sales development representatives (SDRs).

Apr 9, 202685% relevant

YC Removes AI Startup Delve from Website After Allegations of Open Source License Stripping

Y Combinator scrubbed AI startup Delve from its portfolio site after public allegations that the company removed open source licenses from tools and sold them as proprietary software, including from its own customer.

Apr 4, 202685% relevant

OctaPulse Brings AI Robotics to Aquaculture, Starting with Automated Fish Inspection

OctaPulse, a Y Combinator-backed startup, is deploying robotics and computer vision to automate fish inspection in aquaculture. Their system aims to replace manual sampling methods, reduce fish stress, and provide real-time data for better farming decisions.

Mar 2, 202682% relevant

The Missing Manager: How Trace's $3M Bet Aims to Bridge the AI Agent Adoption Gap

Trace, a Y Combinator-backed startup, has raised $3 million to solve enterprise AI agent adoption by providing critical workflow context. The company positions itself as the essential 'manager' layer that orchestrates complex corporate processes, addressing reliability and scalability hurdles that have slowed widespread deployment.

Feb 26, 202670% relevant

AI Research Suggests Whale 'Vowels' in Sperm Whale Communication

AI researchers analyzing sperm whale vocalizations have identified combinatorial structures that function like vowels, marking a step toward decoding cetacean communication.

Apr 15, 202685% relevant

λ-RLM: 8B Parameter Model Using Typed λ-Calculus Beats 405B Performance on Long-Context Tasks

Researchers developed λ-RLM, an 8B parameter model that outperforms 405B models on long-context tasks by replacing recursive code with typed λ-calculus combinators. This approach guarantees termination and reduces latency by up to 4.1x.

Mar 24, 202699% relevant

A Novel Hybrid Heuristic-Reinforcement Learning Framework for Complex Railcar Shunting Problems

Researchers propose a hybrid AI framework combining domain-specific heuristics with Q-learning to optimize the complex, combinatorial problem of railcar shunting in freight yards. The method efficiently handles two-sided track access and multiple locomotives.

Mar 9, 202675% relevant

NanoEuler: GPT-2-Scale 116M Model Built in Pure C/CUDA From Scratch

NanoEuler is a 116M-parameter GPT-2-scale model built in pure C/CUDA from scratch. It provides a complete educational training pipeline for understanding LLMs at the lowest level.

Jun 28, 202675% relevant

Claude Code vs. Codex: Real-World Devs Reveal When Each Tool Wins

Claude Code shines at design and greenfield work; pair with Codex for bug fixes. Use CLAUDE.md for guidance.

Jun 20, 202690% relevant

Claude Code Generates Production Lottie Animations via Show HN

Claude Code claimed to generate production Lottie animations via Show HN. No demo or code published; 2 points, 0 comments. Unverified.

Jun 8, 202675% relevant

Claude Code Quality Drops Post-4.6, Users Report 25% Task Failure Rate

Claude Code quality dropped post-4.6 with ~25% instruction misses. Codex offers 95% reliability but less creativity.

Jun 3, 202690% relevant

Claude Code Token Costs Got You Down? Here's How to Cut Usage 40% Without

Claude Code users frustrated by token costs should use /compact, optimize CLAUDE.md, and route cheap models via OpenRouter for simple tasks—no local model matches Claude's quality yet.

Jun 3, 202690% relevant

Stop Building Interfaces: How YC Runs Finance

Claude Code's agent registry model lets non-coders run operations via deterministic tools — YC's 350+ tool ecosystem proves it. Shift from building interfaces to building small tools.

Jun 2, 202665% relevant

Naïve Launches AI Employees That Form LLCs, Open Bank Accounts

YC-backed Naïve launched AI employees that form LLCs and open bank accounts without human oversight.

May 21, 202688% relevant

GBrain: Garry Tan's Agent Memory Uses Markdown as System of Record

GBrain is Garry Tan's agent memory system using markdown as the system of record, with a self-wiring knowledge graph and overnight dream cycle.

May 12, 202682% relevant

Onyx: Open-Source AI Enterprise Search Challenges Glean's $7.2B Valuation

Open-source platform Onyx provides self-hosted AI enterprise search connecting to 40+ tools, offering a free alternative to Glean's $50/user/month SaaS. Backed by YC and $10M seed funding, it's used by Netflix and Ramp.

Apr 22, 202685% relevant

Researchers Achieve Ultra-Long-Horizon Agentic Science with Cohesive AI Agents

A research team has developed AI agents capable of executing and maintaining coherent, long-horizon scientific research workflows. This addresses a core challenge in creating autonomous systems for complex discovery.

Apr 20, 202685% relevant

MiniMax AI Powers Wati's Astra Voice 2.0 for WhatsApp Business

MiniMax AI is providing its voice technology to power Wati's Astra Voice 2.0 platform, enabling businesses to deploy conversational voice AI on WhatsApp in multiple languages.

Apr 16, 202685% relevant

New Research Proposes Unified LLM Framework for Need-Driven Service

A new arXiv paper introduces a large language model framework that unifies living need prediction and service recommendation for local life services. It uses behavioral clustering to filter noise and a curriculum learning + RL strategy to navigate complex decision paths. Experiments show it significantly improves both need prediction and recommendation accuracy.

Apr 16, 202682% relevant

Dual-Enhancement Product Bundling

Researchers propose a dual-enhancement method for product bundling that integrates interactive graph learning with LLM-based semantic understanding. Their graph-to-text paradigm with Dynamic Concept Binding Mechanism addresses cold-start problems and graph comprehension limitations, showing significant performance gains on benchmarks.

Apr 16, 202671% relevant

AI Struggles with Outlier Ideas as Execution Costs Plummet

As AI drastically lowers the cost of executing ideas, its weakness in generating truly novel, outlier concepts makes exceptional human creativity more valuable than ever.

Apr 12, 202675% relevant

Kuaishou's Dual-Rerank: A New Industrial Framework for High-Stakes

Researchers from Kuaishou introduce Dual-Rerank, a framework designed for industrial-scale generative reranking. It addresses the dual dilemma of structural trade-offs (AR vs. NAR models) and optimization gaps (SL vs. RL) through Sequential Knowledge Distillation and List-wise Decoupled Reranking Optimization. A/B tests on production traffic show significant improvements in user satisfaction and watch time with reduced latency.

Apr 10, 202682% relevant

OpenAI Solves Five Erdős Problems with Internal AI Model

OpenAI researchers have reportedly solved five additional unsolved Erdős problems using an internal AI model. This demonstrates significant progress in AI's ability to tackle complex, open-ended mathematical reasoning.

Apr 9, 202695% relevant

Composio Launches Secure Tool Platform to Replace AI Agent Credential Sharing

Composio announced a platform that lets AI agents use external tools without credential sharing, aiming to solve a major security and operational headache for developers.

Apr 7, 202691% relevant

FLAME: A Novel Framework for Efficient, High-Performance Sequential Recommendation

A new paper introduces FLAME, a training framework for sequential recommender systems. It uses a frozen 'anchor' network and a learnable network, combined via modular ensembles, to capture user behavior diversity efficiently. The result is a single model that performs like an ensemble but runs as fast as a single model at inference.

Apr 7, 202682% relevant

Google DeepMind Maps Six 'AI Agent Traps' That Can Hijack Autonomous Systems in the Wild

Google DeepMind has published a framework identifying six categories of 'traps'—from hidden web instructions to poisoned memory—that can exploit autonomous AI agents. This research provides the first systematic taxonomy for a growing attack surface as agents gain web access and tool-use capabilities.

Apr 1, 202695% relevant

OpenAI Internal Model Reportedly Solves Three New Erdős Problems, Marking AI Advance in Pure Mathematics

An internal AI model at OpenAI has reportedly solved three previously unsolved mathematical problems from the Erdős collection. This development signals a potential leap in AI's capacity for abstract reasoning and formal theorem proving.

Apr 1, 202685% relevant

Garry Tan's gstack: Install This 56k-Star 'Virtual Team' for Claude Code

YC CEO Garry Tan open-sourced gstack, a pack of slash commands that turns Claude Code into a structured team of specialists, claiming it helps ship 10k-20k lines of code daily.

Mar 30, 202699% relevant

VMLOps Publishes Comprehensive RAG Techniques Catalog: 34 Methods for Retrieval-Augmented Generation

VMLOps has released a structured catalog documenting 34 distinct techniques for improving Retrieval-Augmented Generation (RAG) systems. The resource provides practitioners with a systematic reference for optimizing retrieval, generation, and hybrid pipelines.

Mar 27, 202685% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety