#model-optimization

[ follow ]
Growth hacking
fromInfoQ
1 week ago

Scaling Large Language Model Serving Infrastructure at Meta

LLM serving is evolving into a foundational technology similar to an operating system.
fromInfoWorld
10 months ago

All the brilliance of AI on minimalist platforms

Fast forward to 2024, our reliance on massive data infrastructures is evaporating, with AI systems running on palm-sized devices. Apple & Qualcomm chips integrate AI for tasks like language translation and photo processing.
Digital life
#machine-learning
Artificial intelligence
fromHackernoon
3 months ago

Rethinking AI Quantization: The Missing Piece in Model Efficiency | HackerNoon

Quantum strategies optimize LLM precision while balancing accuracy and effectiveness through methods like post-training quantization and quantization-aware training.
Artificial intelligence
fromHackernoon
3 months ago

Rethinking AI Quantization: The Missing Piece in Model Efficiency | HackerNoon

Quantum strategies optimize LLM precision while balancing accuracy and effectiveness through methods like post-training quantization and quantization-aware training.
Scala
fromHackernoon
3 months ago

The Hidden Power of "Cherry" Parameters in Large Language Models | HackerNoon

Parameter heterogeneity in LLMs shows that a small number of parameters greatly influence performance, leading to the development of the CherryQ quantization method.
[ Load more ]