Alibaba reveals 82 percent GPU resource savings • The Register

The Register

The Register9 hrs ago

Alibaba reveals 82 percent GPU resource savings • The Register

Chinese tech giant Alibaba has published a paper detailing scheduling tech it has used to achieve impressive utilization improvements across the GPU fleet it uses to power inferencing workloads – which is nice, but not a breakthrough that will worry AI investors.

Titled “Aegaeon: Effective GPU Pooling for Concurrent LLM Serving on the Market”, the paper [PDF] opens by pointing out that model-mart Hugging Face lists over a million AI models, although customers mostly run just a few of them. Alibaba Cloud nonetheless offers many models but found it had to dedicate 17.7 percent of its GPU fleet to serving just 1.35 percent of customer requests.

The reason for that discrepancy is that service providers typically configure their GPUs to run only two or three models, which is all that GPUs can

24

Amazon Web Services Says Outage Has Been Resolved

Amazon Web Services Says Outage Has Been Resolved

Deadline14 hrs ago

8621

Huge global outage impacts Amazon, Fortnite and Snapchat

Huge global outage impacts Amazon, Fortnite and Snapchat

CNN10/20

130

Major AWS outage takes down web services like Snapchat and Ring

Major AWS outage takes down web services like Snapchat and Ring

NBC News8 hrs ago

123

A theory why the internet is going down the toilet : Planet Money : NPR

A theory why the internet is going down the toilet : Planet Money : NPR

NPR3 hrs ago

5

OpenAI strengthens Sora 2 guardrails after actor Bryan Cranston raises alarm

OpenAI strengthens Sora 2 guardrails after actor Bryan Cranston raises alarm

NBC News10 hrs ago

68

Analysis-Chip crunch: how the AI boom is stoking prices of less trendy memory

Analysis-Chip crunch: how the AI boom is stoking prices of less trendy memory

Reuters US Business

Reuters US Business15 hrs ago

130

Canvas crash tied to nationwide AWS outage

Canvas crash tied to nationwide AWS outage

23ABC News Bakersfield

23ABC News Bakersfield14 hrs ago

39

Massive Amazon Outage Disrupts Internet Worldwide — Millions Affected In NYC & Beyond

Massive Amazon Outage Disrupts Internet Worldwide — Millions Affected In NYC & Beyond

Secret NYC Top18 hrs ago

67

Chaos erupts over Trump-appointed judges' order on troop deployment

Chaos erupts over Trump-appointed judges' order on troop deployment

Raw Story14 hrs ago

941

Donald Trump Decimates Melania’s East Wing For His ‘Big, Beautiful’ Ballroom

Donald Trump Decimates Melania’s East Wing For His ‘Big, Beautiful’ Ballroom

The Daily Beast

The Daily Beast14 hrs ago

119

Looks like you've reached the bottom