LLM Scale Up - 搜索 News

New LLM developed for under $50 outperforms OpenAI’s o1-preview

“Our model s1-32B exhibits test-time scaling,” the researchers ... The former model achieved scores up to 27% higher than OpenAI’s LLM. In another test that involved math questions, s1-32B ...

scmp.com1 个月

Meet DeepSeek: the Chinese start-up that is changing how AI models are trained

fix broken links or scale up its capabilities. DeepSeek’s development of a powerful LLM at less cost than what bigger companies spend shows how far Chinese AI firms have progressed, despite US ...

VentureBeat19 天

DeepMind’s new inference-time scaling technique improves planning accuracy in LLMs

Learn More Inference-time scaling is one of the big themes of artificial ... The solutions are generated by an LLM that has been given a description of the problem along with useful information ...

12 天

Alibaba launches new AI model that it says outperforms DeepSeek, China's hottest start-up

Alibaba Group Holding released on Wednesday an upgraded version of its Qwen artificial intelligence (AI) model, which it said ...

InfoQ1 个月

Meta Open-Sources Byte Latent Transformer LLM with Improved Scalability

Meta open-sourced Byte Latent Transformer (BLT), an LLM architecture that uses a learned ... According to Meta, BLT unlocks a new dimension for scaling, allowing simultaneous increases in model ...

VentureBeat27 天

MiniMax unveils its own open-source LLM with industry-leading 4M token context

The series includes MiniMax-Text-01, a foundation large language model (LLM), and MiniMax-VL-01, a visual multimodal model. MiniMax-Text-o1, is of particular note for enabling up to 4 million ...

SiliconANGLE1 个月

Diffbot boosts LLM accuracy by tapping into its vast Knowledge Graph of up-to-date information

It extracts the most recent information from these sites using natural language processing and compute vision to keep its database up to date ... Diffbot hopes that its LLM will be used by ...

tom's Hardware on MSN3 个月

Ryzen AI 300 takes big wins over Intel in LLM AI performance — up to 27% faster token ...

AMD's Ryzen AI 300 series of mobile processors beats Intel's mobile competition handily at local large language model (LLM) ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果