DeepSeek releases 'sparse attention' model that cuts API costs in half

DeepSeek releases 'sparse attention' model that cuts API costs in half

Researchers at DeepSeek on Monday released a new experimental model called V3.2-exp, designed to have dramatically lower inference costs when used in long-context operations. DeepSeek announced the model with a post on Hugging Face , also posting a linked academic paper on GitHub.

The most important feature of the new model is called DeepSeek Sparse Attention, an intricate system described in detail in the diagram below. In essence, the system uses a module called a “lightning indexer” to prioritize specific excerpts from the context window. After that, a separate system called a “fine-grained token selection system” chooses specific tokens from within those excerpts to load into the module’s limited attention window. Taken together, they allow the Sparse Attention models to operate

See Full Page

Looks like you've reached the bottom

Interests (0)

Settings

DeepSeek releases 'sparse attention' model that cuts API costs in half

AI is transforming how software engineers do their jobs. Just don't call it 'vibe

Why a Waymo is like a horse

Trump's team keeps posting AI portraits of him. We keep clicking

CNA explains: What is transhumanism?

Criminal sentencing of former Portage mayor moved to January

Donald Trump's Jimmy Kimmel Drama Proves He Has The Most Fragile Ego In Presidential History

Protesters clash with ICE as federal troops enter Portland

California Governor Newsom signs landmark AI safety bill SB 53

Frank founder Charlie Javice sentenced to 7 years in prison for defrauding JPMorgan Chase

DeepSeek: Everything you need to know about the AI chatbot app

1 person dead and 9 injured in shooting at Michigan church, police say

Protesters clash with ICE as federal troops enter Portland

'Church is on fire': Mormon house of worship set ablaze amid active shooter attack

Multiple people shot at Mormon church in Michigan and shooter is down, police say

Four Dead, Eight Injured in Michigan Church Shooting

'Engineered incompetence': Why Trump may have picked Lindsey Halligan to oversee the Comey case

Witness recalls Michigan church shooting: "It was the scariest moment of my life"

Dolly Parton Postpones Las Vegas Concerts Due to Health Issues

Arrest made after boater opens fire on North Carolina waterfront bar killing 3, injuring 5

DNA Links Robert Brashers to 1991 Yogurt Shop Murders