tabular data

15 articles about tabular data in AI news

From BM25 to Corrective RAG: A Benchmark Study Challenges the Dominance of Semantic Search for Tabular Data

A systematic benchmark of 10 RAG retrieval strategies on a financial QA dataset reveals that a two-stage hybrid + reranking pipeline performs best. Crucially, the classic BM25 algorithm outperformed modern dense retrieval models, challenging a core assumption in semantic search. The findings provide actionable, cost-aware guidance for building retrieval systems over heterogeneous documents.

Apr 3, 202682% relevant

From Garbage to Gold: A Theoretical Framework for Robust Tabular ML in Enterprise Data

New research challenges the 'Garbage In, Garbage Out' paradigm, proving that high-dimensional, error-prone tabular data can yield robust predictions through proper data architecture. This has profound implications for enterprise AI deployment.

Mar 16, 202674% relevant

Bridging Data Worlds: How MultiModalPFN Unifies Tabular, Image, and Text Analysis

Researchers have developed MultiModalPFN, an AI framework that extends TabPFN to handle tabular data alongside images and text. This breakthrough addresses a critical limitation in foundation models for structured data, enabling more comprehensive analysis in healthcare, marketing, and other domains where multiple data types coexist.

Feb 25, 202672% relevant

Evolving Demonstration Optimization: A New Framework for LLM-Driven Feature Transformation

Researchers propose a novel framework that uses reinforcement learning and an evolving experience library to optimize LLM prompts for feature transformation tasks. The method outperforms classical and static LLM approaches on tabular data benchmarks.

Mar 12, 202670% relevant

LLM Schema-Adaptive Method Enables Zero-Shot EHR Transfer

Researchers propose Schema-Adaptive Tabular Representation Learning, an LLM-driven method that transforms structured variables into semantic statements. It enables zero-shot alignment across unseen EHR schemas and outperforms clinical baselines, including neurologists, on dementia diagnosis tasks.

Apr 15, 202699% relevant

DISCO-TAB: Hierarchical RL Framework Boosts Clinical Data Synthesis by 38.2%, Achieves JSD < 0.01

Researchers propose DISCO-TAB, a reinforcement learning framework that guides a fine-tuned LLM with multi-granular feedback to generate synthetic clinical data. It improves downstream classifier utility by up to 38.2% versus GAN/diffusion baselines and achieves near-perfect statistical fidelity (JSD < 0.01).

Apr 3, 202698% relevant

Graph Tokenization: A New Method to Apply Transformers to Graph Data

Researchers propose a framework that converts graph-structured data into sequences using reversible serialization and BPE tokenization. This enables standard Transformers like BERT to achieve state-of-the-art results on graph benchmarks, outperforming specialized graph models.

Mar 13, 202670% relevant

Multimodal Knowledge Graphs Unlock Next-Generation AI Training Data

Researchers have developed MMKG-RDS, a novel framework that synthesizes high-quality reasoning training data by mining multimodal knowledge graphs. The system addresses critical limitations in existing data synthesis methods and improves model reasoning accuracy by 9.2% with minimal training samples.

Mar 2, 202680% relevant

CausalTimePrior: The Missing Link for AI That Understands Time and Cause

Researchers have introduced CausalTimePrior, a new framework to generate synthetic time series data with known interventions. This breakthrough addresses a critical gap in training AI models to understand causality over time, paving the way for foundation models in time series analysis.

Mar 13, 202695% relevant

Qwen 2.5 7B Verbalized Confidence Is Epistemically Vacuous, Paper Finds

Qwen 2.5 7B's confidence is near-constant (0.856–0.937) across accuracy from 49% to 75.3%. Combining SHAP with few-shot examples cuts ADS from 1.54 to 0.38 and lifts accuracy to 75.3%.

Jun 19, 202682% relevant

DrugPlayGround Benchmark Tests LLMs on Drug Discovery Tasks

A new framework called DrugPlayGround provides the first standardized benchmark for evaluating large language models on key drug discovery tasks, including predicting drug-protein interactions and chemical properties. This addresses a critical gap in objectively assessing LLMs' potential to accelerate pharmaceutical research.

Apr 6, 202695% relevant

The Future of Production ML Is an 'Ugly Hybrid' of Deep Learning, Classic ML, and Rules

A technical article argues that the most effective production machine learning systems are not pure deep learning or classic ML, but pragmatic hybrids combining embeddings, boosted trees, rules, and human review. This reflects a maturing, engineering-first approach to deploying AI.

Mar 29, 202672% relevant

AI Researchers Crack the Delay Problem: New Algorithm Achieves Optimal Performance in Real-World Reinforcement Learning

Researchers have developed a minimax optimal algorithm for reinforcement learning with delayed state observations, achieving provably optimal regret bounds. This breakthrough addresses a fundamental challenge in real-world AI systems where sensors and processing create unavoidable latency.

Mar 5, 202675% relevant

MedFeat: How AI is Revolutionizing Medical Feature Engineering with Model-Aware Intelligence

Researchers have developed MedFeat, an innovative framework that combines large language models with clinical expertise to create smarter features for medical predictions. Unlike traditional approaches, MedFeat incorporates model awareness and explainability to generate features that improve accuracy and generalization across healthcare settings.

Mar 4, 202675% relevant

MIRAGE AI Framework Bridges Critical Gap in Alzheimer's Diagnosis by Synthesizing MRI Insights from Health Records

Researchers have developed MIRAGE, a novel AI framework that uses knowledge graphs to synthesize diagnostic MRI information from electronic health records, potentially revolutionizing Alzheimer's disease assessment in resource-limited settings by bridging the missing-modality gap.

Mar 4, 202675% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety