600 GB/s of memory bandwidth isn't anything to sneeze at.<p>~$1000 for the Pro B70, if Microcenter is to be believed:<p><a href="https://www.microcenter.com/product/709007/intel-arc-pro-b70-single-fan-32gb-gddr6-pcie-50-graphics-card" rel="nofollow">https://www.microcenter.com/product/709007/intel-arc-pro-b70...</a><p><a href="https://www.microcenter.com/product/708790/asrock-intel-arc-pro-b70-creator-32gb-gddr6-pcie-50-graphics-card" rel="nofollow">https://www.microcenter.com/product/708790/asrock-intel-arc-...</a>
Recent kernels have SR-IOV support for these chips too. B&H has them listed for $950.<p><a href="https://www.bhphotovideo.com/c/product/1959142-REG/intel_33p01ib0bb_arc_pro_b70_32gb.html" rel="nofollow">https://www.bhphotovideo.com/c/product/1959142-REG/intel_33p...</a><p>When 32GB NVIDIA cards seem to start at around $4000 that's a big enough gap to be motivating for a bunch of applications.
I think the B65 is priced at $650. Both supported by llamacpp I believe. With that power draw you could run two of them.
I tend to agree that the vram size and bandwidth is the core thing, but this B70 Pro allegedly has 387 int8 tops vs a 5090 having 3400 int8 tops. 600 compares vs 1792GB/s. I'm delighted so see an option with quarter the price! But man, a tenth the performance?
<a href="https://www.techpowerup.com/347721/sparkle-announces-intel-arc-pro-b70-and-intel-arc-pro-b65-series-graphics-cards" rel="nofollow">https://www.techpowerup.com/347721/sparkle-announces-intel-a...</a>
<a href="https://www.tomshardware.com/pc-components/gpus/nvidia-announces-rtx-50-series-at-up-to-usd1-999" rel="nofollow">https://www.tomshardware.com/pc-components/gpus/nvidia-annou...</a>
Intel GPU prices have stayed fine, but I do wonder if they are viable for Inference if they will wind up like Nvidia GPUs, severely overpriced.
Where's the A310 / A40 successor? Gimme some SR-IOV in a slot-powered, single-width, low-profile card.
New cards in 2026, and targeting Vulkan 1.3?!
Both have 32gb vram. Could be a pretty compelling choice.
Anyone running an ARC card for desktop Linux who can comment on the experience? I've had smooth sailing with AMD GPU's but have never tried Intel.
Running dual Pro B60 on Debian stable mostly for AI coding.<p>I was initially confused what packages were needed (backports kernel + ubuntu kobuk team ppa worksforme). After getting that right I'm now running vllm mostly without issues (though I don't run it 24/7).<p>At first had major issues with model quality but the vllm xpu guys fixed it fast.<p>Software capability not as good as nvidia yet (i.e. no fp8 kv cache support last I checked) but with this price difference I don't care. I can basically run a small fp8 local model with almost 100k token context and that's what I wanted.
Ive ran arc on fedora for years and for general desktop use it’s been perfect. For llm’s/coding it’s getting better but it’s rough around the edges. Had a bug where trying to get vram usage through pytorch would crash the system, ect.
There was the video a little while back where LTT built a computer for Linus Torvalds and they put an Intel Arc card inside, so I'd imagine Linux support is at the very least, acceptable.<p>[1] <a href="https://www.youtube.com/watch?v=mfv0V1SxbNA" rel="nofollow">https://www.youtube.com/watch?v=mfv0V1SxbNA</a>
Since they fired the entire Arc team and a lot of the senior engineers already updated their Linkedins to reflect their new positions at AMD, Nvidia, and others, as well as laying off most of their Linux driver team (GPU and non-GPU), uh...<p>WTF?
You are exaggerating, right? They didn't really fire the entire Arc team did they? I couldn't find a source saying that.
Nope, no exaggeration.<p>The news that Celestial is basically canceled already hit the HN front page, as well as Druid has been canceled before tapeout.<p>Celestial will only be issued in the variant that comes in budget/industrial embedded Intel platforms that have a combined IO+GPU tile, but the performance big boy desktop/laptop parts that have a dedicated graphics tile will ship an Nvidia-produced tile.<p>There will be no Celestial DGPU variant, nor dedicated tile variant. Drivers will be ceasing support for DGPUs of all flavors, and no new bug fixes will happen for B series GPUs (as there is no B series IGPUs; A series IGPUs will remain unaffected).<p>They signed the deal like 2-3 months ago to cancel GPUs in favor of Nvidia. The other end of this deal is the Nvidia SBCs in the future will be shipping as big-boy variants with Xeon CPUs, Rubin (replacing Blackwell) for the GPU, Vera (replacing Grace) for the on-SBC GPU babysitter, and newest gen Xeons to do the non-inference tasks that Grace can't handle.<p>There is also talk that this deal may lead to Nvidia moving to Intel Foundry, away from TSMC. There is also talk that Nvidia may just buy Intel entirely.<p>For further information, see Moore's Law Is Dead's coverage off and on over the past year.
This is a chip they've had lying around for a while. It's the same architecture as used in the Arc B580 that launched at the end of 2024; this is just a slightly larger sibling. Intel clearly knew that their larger part wouldn't make for a competitive gaming GPU (hence the lack of a consumer counterpart to these cards), but must have decided that a relatively cheap workstation card with 32GB might be able to make some money.
Wake me when they wake up and release a middling card with 128GB memory.
Buy 4?
Which mainboards are cheap and have 4 pcie16x (electrical) slots, that don't need weird risers to fit 4 GPUs
Because I don't want to spend $4k.<p>I want to spend $1500 for a card that can run a proper large model, even if it only can do 25 tk/s.<p>Intel is squandering a golden opportunity to knee-cap AMD and Nvdia, under the totally delusional pretense that intel enterprise cards still have a fighting chance.
Not sure why you'd want this over an apple setup. M4 max is 545GB/s of memory bandwidth - $2k for an entire Mac Studio with 48GB of RAM vs 32 for the B70.
Being able to keep infrastructure on Linux is a big advantage.
My thinking is that I'd pick this, because I can't just plug a Mac into a slot in my server and have it easily integrate with all my other hardware across an ultra fast bus.<p>If they made an M4 on a card that supported all the same standards and was price competitive, though, that might be a good option.
with those $2k you can have 2xB70, with 1.2Tb/sec and 64G Vram, on linux ( and you can scale further while mac prices increase are not linear 0
Support for Single Root IO Virtualization (SR-IOV) to enable compute and Graphics workloads in virtualized environments.
one can upgrade and swap parts with a computer running an Intel GPU. Linux is very well supported compared to Mac hardware.
Funny, I not sure why anyone would use Apple over Linux.