Bringing Up DeepSeek-V4-Flash on AMD MI300X

62 points by kkm3 hours ago

4 comments

maCDzP59 minutes ago
I train on AMD MI250X and managed to get Gemma 4 31B to work - but it took a lot of work on the software side.
- kkm57 minutes ago
  This is very interesting, planning to write about it?
kkm2 hours ago
Also the vllm patch accompanying the blogpost: <a href="https://github.com/doublewordai/vllm-amd-blog-doubleword" rel="nofollow">https://github.com/doublewordai/vllm-amd-blog-doubleword</a>
mezark2 hours ago
We at doubleword are bullish for AMD for low-interactivity inference - it does just take a bigger lift on the software side...
- brcmthrowaway38 minutes ago
  Are you long AMD?
benlm2 hours ago
Nice work! Would DeepSeek V4 Pro on 8xMI300X work with these patches?