AI Native Medhavi
NewsMCP DirectorySkillsNewsletterSign In

AI Native Developer News

AI development tools, research, and industry news — clustered and ranked by importance.

24h48hWeekMonth
AllFrontier LabsAI Coding ToolsModelsResearchInfrastructureFrameworksNewsCommunityOpen Source
Improve coding agents’ performance with Gemini API Docs MCP and Agent Skills.

Google has launched two tools to improve the performance of coding agents using outdated Gemini API code. The Gemini API Docs MCP aims to enhance the accuracy of code generation by providing updated documentation access, while the Agent Skills tool focuses on training agents to improve their code output. These developments are intended to address issues stemming from the cutoff date of the training data for these agents, ensuring that they can produce relevant and current code for developers.

Google Developers Blog·just now
ai-coding-toolsai-frameworksfrontier-labs
Visual Studio Code 1.114

The release of Visual Studio Code 1.114 introduces new features that enhance the development experience, particularly for AI developers looking to integrate advanced coding tools into their workflows. This update may streamline the use of AI-assisted development functionalities, potentially improving productivity and efficiency.

VS Code Blog·just now
VS Code Blog
ai-coding-toolsopen-source
datasette-enrichments-llm 0.2a0

The article discusses the release of version 0.2a0 of datasette-enrichments-llm, a tool that enhances Datasette with capabilities powered by large language models (LLMs). This update introduces several new features focused on improving data enrichment and usability within the Datasette framework. Key improvements include enhanced support for structured data and more intuitive interfaces. The update positions datasette-enrichments-llm as a valuable resource for developers seeking to leverage LLMs in their data applications.

Simon Willison·2h ago
Simon Willison
ai-coding-toolsai-frameworksopen-source

Latest

  • Visual Studio Code 1.114
    VS Code Blog-661m ago
  • Improve coding agents’ performance with Gemini API Docs MCP and Agent Skills.
    Google Developers Blog-415m ago
  • Wherefore Art Thou? Provenance-Guided Automatic Online Debugging with Lumos
    arXiv CS.SE1h ago
  • Webscraper: Leverage Multimodal Large Language Models for Index-Content Web Scraping
    arXiv CS.AI1h ago
  • GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verification
    arXiv CS.AI

Latest

  • Visual Studio Code 1.114
    VS Code Blog-661m ago
  • Improve coding agents’ performance with Gemini API Docs MCP and Agent Skills.
    Google Developers Blog-415m ago
  • Wherefore Art Thou? Provenance-Guided Automatic Online Debugging with Lumos
    arXiv CS.SE1h ago
  • Webscraper: Leverage Multimodal Large Language Models for Index-Content Web Scraping
    arXiv CS.AI1h ago
  • GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verification
    arXiv CS.AI
1h ago
  • SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents
    arXiv CS.AI1h ago
  • SyriSign: A Parallel Corpus for Arabic Text to Syrian Arabic Sign Language Translation
    arXiv CS.CL1h ago
  • Compiling Code LLMs into Lightweight Executables
    arXiv CS.SE1h ago
  • HackRep: A Large-Scale Dataset of GitHub Hackathon Projects
    arXiv CS.SE1h ago
  • Dual Perspectives in Emotion Attribution: A Generator-Interpreter Framework for Cross-Cultural Analysis of Emotion in LLMs
    arXiv CS.CL1h ago
  • From Consensus to Split Decisions: ABC-Stratified Sentiment in Holocaust Oral Histories
    arXiv CS.CL1h ago
  • Practical Feasibility of Sustainable Software Engineering Tools and Techniques
    arXiv CS.SE1h ago
  • ChartDiff: A Large-Scale Benchmark for Comprehending Pairs of Charts
    arXiv CS.AI1h ago
  • Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs
    arXiv CS.CL1h ago
  • Concept Training for Human-Aligned Language Models
    arXiv CS.CL1h ago
  • BayesInsights: Modelling Software Delivery and Developer Experience with Bayesian Networks at Bloomberg
    arXiv CS.SE1h ago
  • SkillReducer: Optimizing LLM Agent Skills for Token Efficiency
    arXiv CS.SE1h ago
  • Machine Learning in the Wild: Early Evidence of Non-Compliant ML-Automation in Open-Source Software
    arXiv CS.SE1h ago
  • EcoScratch: Cost-Effective Multimodal Repair for Scratch Using Execution Feedback
    arXiv CS.SE1h ago
  • How and Why Agents Can Identify Bug-Introducing Commits
    arXiv CS.SE1h ago
  • Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus
    arXiv CS.SE1h ago
  • Sustainable AI Assistance Through Digital Sobriety
    arXiv CS.SE1h ago
  • Software Vulnerability Detection Using a Lightweight Graph Neural Network
    arXiv CS.SE1h ago
  • Designing FSMs Specifications from Requirements with GPT 4.0
    arXiv CS.SE1h ago
  • Logging Like Humans for LLMs: Rethinking Logging via Execution and Runtime Feedback
    arXiv CS.SE1h ago
  • Kwame 2.0: Human-in-the-Loop Generative AI Teaching Assistant for Large Scale Online Coding Education in Africa
    arXiv CS.CL1h ago
  • CADEL: A Corpus of Administrative Web Documents for Japanese Entity Linking
    arXiv CS.CL1h ago
  • SiPaKosa: A Comprehensive Corpus of Canonical and Classical Buddhist Texts in Sinhala and Pali
    arXiv CS.CL1h ago
  • MemRerank: Preference Memory for Personalized Product Reranking
    arXiv CS.CL1h ago
  • The Thiomi Dataset: A Large-Scale Multimodal Corpus for Low-Resource African Languages
    arXiv CS.CL1h ago
  • 1h ago
  • SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents
    arXiv CS.AI1h ago
  • SyriSign: A Parallel Corpus for Arabic Text to Syrian Arabic Sign Language Translation
    arXiv CS.CL1h ago
  • Compiling Code LLMs into Lightweight Executables
    arXiv CS.SE1h ago
  • HackRep: A Large-Scale Dataset of GitHub Hackathon Projects
    arXiv CS.SE1h ago
  • Dual Perspectives in Emotion Attribution: A Generator-Interpreter Framework for Cross-Cultural Analysis of Emotion in LLMs
    arXiv CS.CL1h ago
  • From Consensus to Split Decisions: ABC-Stratified Sentiment in Holocaust Oral Histories
    arXiv CS.CL1h ago
  • Practical Feasibility of Sustainable Software Engineering Tools and Techniques
    arXiv CS.SE1h ago
  • ChartDiff: A Large-Scale Benchmark for Comprehending Pairs of Charts
    arXiv CS.AI1h ago
  • Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs
    arXiv CS.CL1h ago
  • Concept Training for Human-Aligned Language Models
    arXiv CS.CL1h ago
  • BayesInsights: Modelling Software Delivery and Developer Experience with Bayesian Networks at Bloomberg
    arXiv CS.SE1h ago
  • SkillReducer: Optimizing LLM Agent Skills for Token Efficiency
    arXiv CS.SE1h ago
  • Machine Learning in the Wild: Early Evidence of Non-Compliant ML-Automation in Open-Source Software
    arXiv CS.SE1h ago
  • EcoScratch: Cost-Effective Multimodal Repair for Scratch Using Execution Feedback
    arXiv CS.SE1h ago
  • How and Why Agents Can Identify Bug-Introducing Commits
    arXiv CS.SE1h ago
  • Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus
    arXiv CS.SE1h ago
  • Sustainable AI Assistance Through Digital Sobriety
    arXiv CS.SE1h ago
  • Software Vulnerability Detection Using a Lightweight Graph Neural Network
    arXiv CS.SE1h ago
  • Designing FSMs Specifications from Requirements with GPT 4.0
    arXiv CS.SE1h ago
  • Logging Like Humans for LLMs: Rethinking Logging via Execution and Runtime Feedback
    arXiv CS.SE1h ago
  • Kwame 2.0: Human-in-the-Loop Generative AI Teaching Assistant for Large Scale Online Coding Education in Africa
    arXiv CS.CL1h ago
  • CADEL: A Corpus of Administrative Web Documents for Japanese Entity Linking
    arXiv CS.CL1h ago
  • SiPaKosa: A Comprehensive Corpus of Canonical and Classical Buddhist Texts in Sinhala and Pali
    arXiv CS.CL1h ago
  • MemRerank: Preference Memory for Personalized Product Reranking
    arXiv CS.CL1h ago
  • The Thiomi Dataset: A Large-Scale Multimodal Corpus for Low-Resource African Languages
    arXiv CS.CL1h ago