json
30 articles about json in AI news
How Structured JSON Inputs Eliminated Hallucinations in a Fine-Tuned 7B Code Model
A developer fine-tuned a 7B code model on consumer hardware to generate Laravel PHP files. Hallucinations persisted until prompts were replaced with structured JSON specs, which eliminated ambiguous gap-filling errors and reduced debugging time dramatically.
How to Build Interactive Apps with Claude Code Using JSON as a Bridge
A developer built a music maker where Claude generates beats via JSON, showing how to create collaborative AI tools without complex APIs.
Stop Clicking 'Approve': A .claude/settings.json Template for 80% Fewer
A practical guide to configuring Claude Code's permissions file to auto-approve routine development commands, speeding up your workflow without sacrificing safety.
SSL: Structured Skill Language Boosts Skill Discovery MRR to 0.707
Researchers propose SSL, a three-layer typed JSON representation for AI agent skills, replacing unstructured SKILL.md prose. Using an LLM normalizer, SSL improves Skill Discovery MRR from 0.573 to 0.707 and Risk Assessment macro F1 from 0.744 to 0.787 on a newly released 6,184-skill corpus.
DharmaOCR: New Small Language Models Set State-of-the-Art for Structured
A new arXiv preprint presents DharmaOCR, a pair of small language models (7B & 3B params) fine-tuned for structured OCR. They introduce a new benchmark and use Direct Preference Optimization to drastically reduce 'text degeneration'—a key cause of performance failures—while outputting structured JSON. The models claim superior accuracy and lower cost than proprietary APIs.
Claude Code Hooks: How to Auto-Format, Lint, and Test on Every Save
Configure hooks in .claude/settings.json to run prettier, eslint, and tests automatically, ensuring clean code without manual intervention.
BloClaw: New AI4S 'Operating System' Cuts Agent Tool-Calling Errors to 0.2% with XML-Regex Protocol
Researchers introduced BloClaw, a unified operating system for AI-driven scientific discovery that replaces fragile JSON tool-calling with a dual-track XML-Regex protocol, cutting error rates from 17.6% to 0.2%. The system autonomously captures dynamic visualizations and provides a morphing UI, benchmarked across cheminformatics, protein folding, and molecular docking.
What Cursor's 8GB Storage Bloat Teaches Us About Claude Code's Clean Architecture
A deep dive into Cursor's scattered 8GB local storage reveals why Claude Code's ~/.claude/projects/*.jsonl approach is better for developers.
How This Developer Built a Production-Ready RAG System with Claude Code in One Weekend
A developer used Claude Code to create a structured JSON-to-PDF knowledge base with 105 quotes, demonstrating how to build RAG-ready datasets faster than ever.
CCmeter: The Open-Source Dashboard That Reveals Exactly Why Your Claude
CCmeter parses Claude Code's local session logs to surface cache-busting patterns, cost leaks, and model-swap simulations. Free, local-first, zero telemetry.
Microsoft's Playwright MCP Server Replaces Vision for Web Agents
Microsoft built an MCP server for Playwright that lets AI agents interact with web pages using the accessibility tree, eliminating the need for screenshots and vision models. This approach reduces hallucinations and broken selectors, working with tools like Cursor, VS Code, and Claude Desktop.
Gemini App Gets File Creation and Its Own File Directory
The Gemini app now supports file creation and a dedicated file directory, enabling users to work directly within the app. This transforms Gemini from a conversational AI into a more autonomous workspace tool.
Version Sentinel: A Claude Code Plugin That Blocks Hallucinated Package Versions
Version Sentinel uses Claude Code's hook system to intercept dependency changes and require version verification, preventing supply-chain risks from hallucinated package versions.
The 2026 CLAUDE.md Playbook: 8 Rules That Make Your Agent 2x More Effective
The 2026 consensus on CLAUDE.md: shorter files, falsifiable rules, and explicit enforcement. Here's the 8-rule framework to stop your agent from fighting stale configs.
Use Claude Code to Automate Systematic Literature Reviews
Claude Code can automate systematic literature reviews: scrape papers, extract key themes, and generate structured summaries — all from the terminal.
Build Reusable Data Science Workflows with Claude Skills and Subagents
Claude Skills and Subagents let you package prompts into reusable modules, freeing data scientists from repetitive AI adjustments for EDA, modeling, and deployment.
Agent Harnessing: The Infrastructure That Makes AI Agents Work
A detailed technical guide argues that the model is not the hard part of building AI agents. The six-component harness — context management, memory, tools, control flow, verification, and coordination — is what separates production-grade agents from those that fail silently.
How a Nursing Student Used Claude Haiku to Build a 660K-Page Drug Database Solo
Learn how Claude Haiku enabled a solo developer to classify thousands of medical conditions and build a production-grade pharmaceutical database.
Cloudflare Ships Enterprise MCP Governance
Cloudflare's MCP portal aggregates servers behind Cloudflare Access auth, while Code Mode collapses APIs into two tools. But most SaaS MCP endpoints lack controls — here's how to protect your Claude Code workflows.
Doby Cuts Claude Code Navigation Tokens by 95% with Spec-First Workflow
A spec-first fix workflow that slashes navigation tokens 95% and enforces plan docs as source of truth before code changes.
AWS Bedrock's New MCP Tools Are a Game-Changer for Claude Code Users
AWS Bedrock has released new tools for the Model Context Protocol (MCP), enabling developers to build custom servers that connect Claude Code directly to AWS data and services.
Google's Design.md Gives AI Coding Agents a Visual Design Memory
Google introduced Design.md, a file format for storing design tokens and rules that AI coding agents can read to maintain visual consistency, addressing a key failure point in automated UI generation.
Shopify Engineering details 'Flow generation through natural language'
Shopify Engineering describes a 2026 approach to generating complex workflows (flows) from natural language prompts using an agentic modeling framework, enabling non-technical users to create automation.
From Checkout to Trust Layer: How Merchants Can Prepare for Agentic Commerce
The article discusses the evolution of e-commerce from simple checkout processes to a future where AI shopping agents act on behalf of consumers. It argues that success in this 'agentic commerce' era depends on merchants building a robust trust layer with data security, transparency, and reliability at its core.
Stop Losing Agent Context: Implement Session Memory Files in Your Claude
A simple pattern using structured markdown files to persist session state across context windows, preventing Claude Code agents from redoing work or making inconsistent decisions.
Google Open-Sources OSV-Scanner: AI-Powered Dependency Vulnerability Scanner
Google has open-sourced OSV-Scanner, a vulnerability scanner that maps project dependencies against the OSV database across 11+ ecosystems. It features guided remediation and call analysis to reduce false positives.
MCP's 'By Design' Security Flaw
The Model Context Protocol's power comes with risk: servers you install can run code on your system. Learn how to audit and manage MCP server permissions.
10 Claude Code Skills That Actually Work: A Solo Developer's Vetted List
A curated list of the most effective Claude Code skills for developers, based on hands-on testing, focusing on practical MCP servers and workflow enhancements.
From CI Fire to 9% Interruption
Learn the four guardrail patterns and three-phase CLAUDE.md strategy that turns auto-approve from a CI-breaking risk into a productivity superpower.
Why Claude Code's 'Tool Calls' Aren't Hooks — And How to Design for Its
Understanding Claude's 8-step tool pipeline—from edge routing to result injection—is critical for structuring error handling, timeouts, and debugging in production applications.