DeepSeek

127 posts

DeepSeek

@deepseek_ai

Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.

deepseek.comJoined October 2023

0 Following

559.3K Followers

DeepSeek’s posts

Pinned

DeepSeek

@deepseek_ai

To prevent any potential harm, we reiterate that

@deepseek_ai

is our sole official account on Twitter/X. Any accounts: - representing us - using identical avatars - using similar names are impersonations. Please stay vigilant to avoid being misled!

DeepSeek-R1 is here!

Performance on par with OpenAI-o1

Fully open-source model & technical report

MIT licensed: Distill & commercialize freely!

Website & API are live now! Try DeepThink at chat.deepseek.com today!

1/n

Introducing DeepSeek-V3! Biggest leap forward yet:

60 tokens/second (3x faster than V2!)

Enhanced capabilities

API compatibility intact

Fully open-source models & papers

1/n

GIF

Replying to

License Update!

DeepSeek-R1 is now MIT licensed for clear open access

Open for the community to leverage model weights & outputs

API outputs can now be used for fine-tuning & distillation

3/n

DeepSeek-R1-Lite-Preview is now live: unleashing supercharged reasoning power!

o1-preview-level performance on AIME & MATH benchmarks.

Transparent thought process in real-time.

Open-source models & API coming soon!

Try it now at chat.deepseek.com #DeepSeek

Replying to

Bonus: Open-Source Distilled Models!

Distilled from DeepSeek-R1, 6 small models fully open-sourced

32B & 70B models on par with OpenAI-o1-mini

Empowering the open-source community

Pushing the boundaries of **open AI**!

2/n

Introducing DeepSeek App!

FREE to use with seamless interaction

Now officially available on App Store & Google Play & Major Android markets

Download now: download.deepseek.com/app/

1/3

Replying to

DeepSeek-R1: Technical Highlights

Large-scale RL in post-training

Significant performance boost with minimal labeled data

Math, code, and reasoning tasks on par with OpenAI-o1

More details: github.com/deepseek-ai/De

4/n

DeepSeek-Coder-V2: First Open Source Model Beats GPT4-Turbo in Coding and Math > Excels in coding and math, beating GPT4-Turbo, Claude3-Opus, Gemini-1.5Pro, Codestral. > Supports 338 programming languages and 128K context length. > Fully open-sourced with two sizes: 230B (also

DeepSeek-V2.5-1210: The Grand Finale

Internet Search is now live on the web! Visit chat.deepseek.com and toggle “Internet Search” for real-time answers.

(1/3)

GIF

Replying to

API Access & Pricing

Use DeepSeek-R1 by setting model=deepseek-reasoner

$0.14 / million input tokens (cache hit)

$0.55 / million input tokens (cache miss)

$2.19 / million output tokens

API guide: api-docs.deepseek.com/guides/reasoni

5/n

DeepSeek-VL2 is here! Our next-gen vision-language model enters the MoE era.

DeepSeek-MoE arch + dynamic image tilling

3B/16B/27B sizes for flexible use

Outstanding performance across all benchmarks

1/n

Replying to

API Pricing Update

Until Feb 8: same as V2!

From Feb 8 onwards: Input: $0.27/million tokens ($0.07/million tokens with cache hits) Output: $1.10/million tokens

Still the best value in the market!

3/n

Launching DeepSeek-V2: The Cutting-Edge Open-Source MoE Model!

Highlights: > Places top 3 in AlignBench, surpassing GPT-4 and close to GPT-4-Turbo. > Ranks top-tier in MT-Bench, rivaling LLaMA3-70B and outperforming Mixtral 8x22B. > Specializes in math, code and reasoning.

DeepSeekMath: Approaching Mathematical Reasoning Capability of GPT-4 with a 7B Model. Highlights: - Continue pre-training DeepSeek-Coder-Base-v1.5 7B with 120B math tokens from Common Crawl. - Introduce GRPO, a variant of PPO, that enhances mathematical reasoning and reduces

Replying to

Open-source spirit + Longtermism to inclusive AGI

DeepSeek’s mission is unwavering. We’re thrilled to share our progress with the community and see the gap between open and closed models narrowing.

This is just the beginning! Look forward to multimodal support and

Replying to

What’s new in V3?

671B MoE parameters

37B activated parameters

Trained on 14.8T high-quality tokens

Dive deeper here: Model

github.com/deepseek-ai/De Paper

github.com/deepseek-ai/De

2/n

Replying to

DeepSeek has not issued any cryptocurrency. Currently, there is only one official account on the Twitter platform. We will not contact anyone through other accounts.Please stay vigilant and guard against potential scams.

Introducing Janus: a revolutionary autoregressive framework for multimodal AI! By decoupling visual encoding & unifying them with a single transformer, it outperforms previous models in both understanding & generation.

Powerful, simple, flexible, & next-gen ready!

Exciting news! We open-sourced DeepSeek-V2-0628 checkpoint, the No.1 open-source model on the LMSYS Chatbot Arena Leaderboard

@lmsysorg

. Detailed Arena Ranking: Overall No.11, Hard Prompts No.3, Coding No.3, Longer Query No.4, Math No.7. DeepSeek-V2-0628 is released at

Respect to Artifacts of Claude 3.5 Sonnet! DeepSeek-Coder-V2 can do the same cool stuff directly in your browser. Visit coder.deepseek.com -> select "Coder V2" -> input prompt -> click “Run HTML” to see the magic happen! #DeepSeekCoder #Claude

GIF

Exciting news! We’ve officially launched DeepSeek-V2.5 – a powerful combination of DeepSeek-V2-0628 and DeepSeek-Coder-V2-0724! Now, with enhanced writing, instruction-following, and human preference alignment, it’s available on Web and API. Enjoy seamless Function Calling,

Replying to

Impressive Results of DeepSeek-R1-Lite-Preview Across Benchmarks!

Exciting news! DeepSeek API now launches context caching on disk, with no code changes required! This new feature automatically caches frequently referenced contexts on distributed storage, slashing API costs by up to 90%. For a 128K prompt with high reference, the first token

After 3 months, the AI Mathematical Olympiad (AIMO) on Kaggle has announced the winners!

We're thrilled to see the Top 4 teams all chose DeepSeekMath-7B as their base model, with Numina

@JiaLi52524397

achieving 29/50 correct answers!

Even Terence Tao was amazed.

78K