Large Model Programming

Meta’s new Code Llama large language model optimized for programming tasks

Meta Platforms Inc. today introduced Code Llama, an open-source large language model that can automatically generate code snippets and explain how they work. The model is free for commercial use. Code ...

Business Wire

Z.ai Open-Sources GLM-4.7, a New Generation Large Language Model Built for Real Development Workflows

SINGAPORE--(BUSINESS WIRE)--Z.ai released GLM-4.7 ahead of Christmas, marking the latest iteration of its GLM large language model family. As open-source models move beyond chat-based applications and ...

VentureBeat

Mistral announces Codestral, a code-generation LLM it says outperforms all others

Today, Paris-based Mistral, the AI startup that raised Europe’s largest-ever seed round a year ago and has since become a rising star in the global AI domain, marked its entry into the programming and ...

Nature

Large language models help computer programs to evolve

Romera-Paredes and colleagues’ work is the latest step in a long line of research that attempts to create programs automatically by taking inspiration from biological evolution, a field called genetic ...

Global Times

Qwen3.6-Plus tops global usage chart on debut, as new Chinese AI models rank among world’s leading benchmarks

Qwen’s new model, Qwen3.6-Plus, topped the daily rankings on the widely recognized global large-model API platform OpenRouter on Saturday, according to the platform’s official website. The model just ...

The Verge

Meta’s free Code Llama AI programming tool closes the gap with GPT-4

Code Llama 70B can generate and debug larger programming strings than Meta’s previous models. Code Llama 70B can generate and debug larger programming strings than Meta’s previous models. Meta’s ...

InfoWorld

Large language models: The foundations of generative AI

Large language models evolved alongside deep-learning neural networks and are critical to generative AI. Here's a first look, including the top LLMs and what they're used for today. Large language ...

GizChina

Xiaomi Launches MiMo Large Model With TokenPlan Pricing — Starting at 39 Yuan

Xiaomi just entered the AI model API market properly. The MiMo large model is live, and it comes with a tiered subscription system called TokenPlan that converts usage into credits rather than ...

MIT Technology Review

Anthropic can now track the bizarre inner workings of a large language model

What the firm found challenges some basic assumptions about how this technology really works. The AI firm Anthropic has developed a way to peer inside a large language model and watch what it does as ...

TechNode

ByteDance unveils UltraMem architecture to reduce large model inference costs by up to 83%

ByteDance’s Doubao Large Model team yesterday introduced UltraMem, a new architecture designed to address the high memory access issues found during inference in Mixture of Experts (MoE) models.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results