Language Model Parameters

Why Do Researchers Care About Small Language Models?

Larger models can pull off a wider variety of feats, but the reduced footprint of smaller models makes them attractive tools.

InfoWorld23d

Large language models: The foundations of generative AI

During training, these switches are adjusted to optimize the network’s overall performance in understanding and generating language. More parameters make a model more accurate, but models with ...

MIT Technology Review12d

OpenAI just released GPT-4.5 and says it is its biggest and best chat model yet

But it could be the last release in OpenAI's classic LLM lineup.

2don MSN

Foxconn Builds FoxBrain, Its Own AI Model

The world’s largest contract electronics maker said Monday it has built its own large language model with reasoning ...

1don MSN

Foxconn launches its first traditional Chinese AI model FoxBrain

Foxconn Technology (OTCPK:FXCOF) launched its first large language model called FoxBrain, with a lower-cost model training ...

1don MSN

Foxconn unveils first large language model

Taiwan’s Foxconn said on Monday it has launched its first large language model and plans to use the technology to improve ...

InfoWorld12d

Microsoft’s Phi-4-multimodal AI model handles speech, text, and video

The new small language model can help developers build multimodal AI applications for lightweight computing devices, ...

Hackaday1mon

New Open Source DeepSeek V3 Language Model Making Waves

In the world of large language models (LLMs ... Its new DeepSeek-V3 model is not only open source, it also claims to have been trained for only a fraction of the effort required by competing ...

SiliconANGLE13d

Microsoft releases new Phi models optimized for multimodal processing, efficiency

A language model’s attention mechanism helps it determine ... is an upgraded version of Phi-4-mini with 5.6 billion parameters. It can process not only text but also images, audio and video.

Alibaba's AI reasoning model drives shares higher

Alibaba Group's release of an artificial intelligence (AI) reasoning model, which it said was on par with global hit DeepSeek ...

TechCrunch22d

Mistral releases regional model focused on Arabic language and culture

The next frontier for large language models (LLMs), one of ... Mistral Saba is a relatively small model with 24 billion parameters. As a reminder, fewer parameters generally leads to better ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results