Now Available: Optimized DeepSeek-R1 On GMI Cloud

daytonhui · February 4, 2025, 11:40pm

GMI Cloud is happy to announce that we are hosting DeepSeek and its distilled models!

GMI Cloud is excited to announce that we are now hosting a dedicated DeepSeek-R1 inference endpoint, on optimized, US-based hardware.

What’s DeepSeek-R1? Read our initial takeaways here.

Model Provider: DeepSeek
Type: Chat
Parameters: 685B
Deployment: Serverless (MaaS) or Dedicated Endpoint
Quantization: FP16
Context Length: The model can remember and process up to 128,000 tokens from previous inputs within a single session.

Additionally, we are offering the following distilled models:

Try our token-free service with unlimited usage!

Reach out for access to our dedicated endpoint here.

Topic	Replies	Views
GMI Cloud on DeepSeek-R1 General blog	12	January 30, 2025
DeepSeek-R1's paper can be read here Large Language Model research-paper	11	January 28, 2025
GitHub - deepseek-ai/DeepSeek-VL2: DeepSeek-VL2: Mixture-of-Experts Vision-Language Large Language Model	5	February 7, 2025
GMI Cloud Partners with VAST Data to Build Infrastructure for Enterprises General blog	5	February 3, 2025
Welcome to GMI Cloud! :wave: General	15	January 3, 2025