Research

All articles in the research category.

Task-Seeded Synthetic Data Improved NVIDIA's Model by 11 Points on Hard Science Questions

NVIDIA's research shows that synthetic training data structured around task families—not raw scale—drives targeted capability gains. Their approach improved scientific reasoning by 11 points while keeping math and code performance stable.

Dr. Sana Okafor Jun 4, 2026

research 9 min read

How DPO Cuts Text Degeneration by 59% Without Retraining From Scratch

DharmaOCR's methodology proves Direct Preference Optimization isn't just for chat alignment. Applied after supervised fine-tuning, DPO reduced text degeneration by an average of 59.4% across five vision-language model families—with zero exceptions.

Dr. Sana Okafor Jun 3, 2026

research 7 min read

Every Major AI Model Fails Multi-Turn Attacks: What Cisco's 2026 Research Means for Enterprise Safety

Single-turn safety benchmarks don't predict real-world vulnerability. Cisco's testing of 15 frontier models reveals that iterative attacks succeed up to 88% of the time—even against models that look secure in standard evaluations.

Dr. Sana Okafor Jun 1, 2026

research 7 min read

Frontier AI Models Fail Basic Enterprise IT Tasks: ITBench-AA Benchmark Shows 47% Peak Score in 2026

The first benchmark for agentic enterprise IT tasks reveals an uncomfortable truth: the best AI models score below 50% on real-world site reliability engineering tasks. ITBench-AA, developed by Artificial Analysis and IBM, shows frontier models struggle with Kubernetes incident diagnosis despite excelling at other benchmarks.

Dr. Sana Okafor May 27, 2026

research 7 min read

NVIDIA's Diffusion Language Models Hit 865 Tokens/Second — 6× Faster Than GPT-Style Generation

NVIDIA's new diffusion language models generate multiple tokens in parallel, hitting 865 tokens/second on B200 hardware — roughly 6× faster than traditional autoregressive models. Unlike GPT-style generation that produces one token at a time, these models draft and refine text blocks simultaneously while maintaining accuracy.

Dr. Sana Okafor May 23, 2026

research 9 min read

Text Degeneration in LLMs: The Hidden Production Cost Inflating Inference by 42%

A structural failure mode in autoregressive language models causes fewer than 3% of requests to consume nearly half of total inference time. New research from DharmaOCR shows the problem is built into training objectives—and proposes a fix grounded in the training distribution itself.

Dr. Sana Okafor May 22, 2026

research 7 min read

GPT-5.2 Proves a New Particle Physics Result: What It Means for AI-Assisted Science in 2026

GPT-5.2 identified a pattern human physicists missed, conjectured a formula for gluon scattering amplitudes, then proved it—marking a shift from AI as research tool to AI as research partner. The result challenges a decades-old assumption about particle interactions.

Dr. Sana Okafor Feb 13, 2026