I keep seeing these "sovereign" LMs time and time again. In Sweden we had GPT-SW3 (<a href="https://www.ai.se/en/project/gpt-sw3" rel="nofollow">https://www.ai.se/en/project/gpt-sw3</a>) and same story there. Instead of burning money on "sovereign" claims, national research labs should instead focus on building on top of solid baselines (like Qwen/Kimi) and finetuning frontier models with real agentic utility that can be applied across actual use cases and can be widely used by its people, basically for free. Nations should mirror what Cursor has done with Composer 2.5 for example.
Disagree, it’s in the country’s best interest to facilitate internal expertise on the full stack and own their “supply chain” so to speak and fight brain drain. The outcome isn’t just the model, it’s the expertise. Otherwise all their smartest folks will depart for countries where LLM development is strongest.
Got it, so every country should focus on having a mediocre-at-best AI strategy by refusing to work together? Surely this will create a better future instead of pooling resources.<p>This vaguely-nationalist world view around tech that’s emerging in Europe is dangerous, man.<p>On the brain drain problem in particular, one way to ensure talent sticks around is to create a good environment for people to do their best work. In much of Europe, getting bureaucracy out of the way and encouraging real investment would go a long way. People leave because they can make more money and they want to be surrounded by the best people. People would trade some of that off to stick around their home countries, however if you go to California and talk to folks from e.g. NL or DE working on this stuff, they have a lot to say about innovation and working culture back home.
This "vaguely nationalistic world view around tech" is a direct consequence of the US government weaponizing its leading tech firms reach into the EU for nationalistic purposes.<p>The EU was build on the principles of collaboration, to overcome the nationalistic impulses. Free trade and free movement of people, no need for everyone to replicate what everyone else is doing. Preventing individual nation states from favoring and supporting their home grown firms over other EU firms is a central legal principle.<p>But this only works if it is reciprocal. And ideally if the partners are roughly on the same level. When you force developing nations in trade deals to not protect local firms, you are also preventing them from moving up the value chain and locking them into the position of "raw material providers".<p>When trading with China we know that China has absolutely no qualms supporting its strategic industries with a truck load of subsidies. And it is preventing foreign firms from investing and selling on the domestic market.<p>For the longest time the US was considered a safe partner though. Sure, plenty of disagreements in the details, but in principle someone whom you can rely on. That idea has been decisevly dismantled over the last 10ish years. The US unilaterally cut of the EMails of EU courts. The US has unilaterally decided that EU partners cannot use Fable/Mythos.<p>The only reasonable reaction is to make sure that the EU can maintain and create its own critical infrastructure.
>This "vaguely nationalistic world view around tech" is a direct consequence of the US government weaponizing its leading tech firms reach into the EU for nationalistic purposes.<p>And in response NL should weaponize ASML. Then both sides would naturally back down. Specialization is the most efficient way of developing tech civilization.<p>Whereis everybody building their own mediocre versions would be repeating Russia in its attempt to make their own national messenger - a lot of money sunk, yet people are still using Telegram/etc.
> This vaguely-nationalist world view around tech that’s emerging in Europe is dangerous, man.<p>It's a direct response to the MAGA / America First attitude of the American electorate.
Hopefully you understand that there's an obvious selection bias you encounter when talking to folks from NL or DE in California. Ofc the people that moved will think there's a good reason for them moving. But people who live here have good reasons for staying here.
Why would it have to be mediocre?<p>Both Netherlands and Sweden produce highly competent researchers. Per capita on par with any other location on the planet, including California.<p>This will be good for Europe as a whole and also the planet.
>and also the planet.<p><i>What a glorious day for Canada and, therefore, the world.</i>
You missed the part where they got 13 million EUR , 3 years ago! and dont have a release yet....Definitely doing it at European pace...they will have release two when AGI is around...
Producing local researchers isn’t enough. If you want a competitive model you need to bring the best researchers from all over the world. That means lots of investment that hasn’t been common in Europe.
Sure, but there is still a more dense talent pool with SV companies, and training frontier models requires massive amounts of capital. Are Netherlands and Sweden prepared to invest 10s of billions?
My impression is the talent pool for core model development in SV isn't huge and is already grabbed by the usual suspects. Everyone else sloshing in the vague broader AIsphere are application bros whose value in the supply chain is minimal.
Honestly I have a strong suspicion that “the talent pool” will shift towards Europe and China and I believe it’s already happening.<p>And yes, I do believe Europe will invest in this technology.
> This vaguely-nationalist world view around tech that’s emerging in Europe is dangerous, man.<p>It’s a response to the actually-nationalist practice of the United States. I can understand why it might feel different from California, but things are a bit scary over here right now.
Collectively Europe has talent and money to support it. Whole point of sovreignity around it is to have inhouse the talent and capacity to have essential resources even if it can be have for cheap from other nations/alliances.<p>That is why people lament at idea of being completely dependent on Russian gas, US tech or Chinese manufacturing.
> pooling resources<p>Your argument would suggest the EU developing a European model would be a better direction. A heavy-weight competitor would help advance the field after all.<p>> getting bureaucracy out of the way and encouraging real investment<p>I don't think this is really about regulation - it's about network effects. The only way to compete with strong network-effects is to create your own.<p>> This vaguely-nationalist world view around tech<p>Nationalism breeds nationalism and it is the fundamental reason European states feel the need to build their own expertise. Can you imagine if your country was subject to the whims of an aspiring dictator?
> This vaguely-nationalist world view around tech that’s emerging in Europe is dangerous, man.<p>Guess which country blocked access to a SOTA model based on national security bullshit.
Tech sovereignty is a response to American imperialism. It has not gone unnoticed that the White House has summoned the tech bros and weaponised tech.
Great, now they will get expertise and then get hired by openai and move
There is something to be said for this "most cheapest" approach, there is also something to be said for making models that are entirely ethically sourced:<p>1. Free of controversy like unlicensed training materials<p>2. Free of exploitative rlfh loops by people in low-wages countries<p>3. The leasons learned (and published) from going through the entire training process on "European" hardware: "AI factories" (the term for Slurm HPC/HTC systems with lots of heavy GPU nodes, heavily subsidized by our government [0])<p>1 and 2 are strong counter-LLM arguments at the moment, and hold back some groups of potential users. Another is energy/water use, so going for maximum green energy would be a nice boon as well. 3 is something I consider to be highly useful for our European identity and "way of the ninja" (for you Naruto fans out there).<p>[0 <a href="https://hpc-portal.eu/funding-opportunities" rel="nofollow">https://hpc-portal.eu/funding-opportunities</a>]
If open frontier models start closing up and states start more export controls on AI services and hardware, it might be good to ensure the supply chain is there to reproduce the SotA, or even a couple generations behind it.
Sounds like calling those model "open source" did its wicked job. You can not take an open weight model and build a next-generation model using that as a foundation. Once those companies decide it's no longer in their interest to release new open weights everything you've created this way becomes a pile of rapidly deprecating legacy.
Do we know for sure how much national corpus of knowledge (like dutch) goes into these "global" models and how that affects "localized" model biases? What's wrong with specialized models?
Today, you keep seeing "sovereign" LMs that are <i>subject</i> to the sovereignty of some human-led state. Tomorrow, the "sovereign" LMs will be called that for a completely different reason.
Their legality is very questionable given all the likely copyright infringement going on, and a state can't really ignore that.
States are the things which <i>can</i> ignore that, and I'm pretty sure US and China already do. No state is going to respect copyright if they think its future is at stake, and apparently even Netherlands thinks the future is at stake.<p>(Of course states can ignore copyright in a legally polite manner, such as asserting that training on all published material in the National Library is fair game)
And what happens once the "solid baselines" become unavailable for a reason or the other?
You keep building on the last available version? Fine tuning is a whole lot cheaper, easier and more useful than pretraining a model from scratch. It's a complete no brainer.
> You keep building on the last available version?<p>yes but a sovereign can allocate some resources and a few people to stay in the loop from a first principles level. No need to wait for a rug pull.<p>Of course, it can not compete with the frontier labs. But good to have researchers and professors "in-house". LLMs are here for the long-term.
Unfortunately in this game first principles requires massive resources, not "some". Building in-house on top of existing open weights is a good way to bootstrap this process, especially since there's nothing inherently magical or particularly expertise-heavy when it comes to weights themselves
Seems like you don’t understand.<p>You take current version and build on top of it. You have the weights.<p>You might not get some n+1 version at some point but the n version you will have will be still most likely much better than whatever you come up with burning good will money of people believing in „sovereignty”.<p>You are not getting ahead in this game by being „true to your local values” capital expenditure is insane in this game.
Kimi and Qwen come out of China, which means that their training material may be biased e.g. relating to Taiwan [1]. In addition, there is no way to determine what input went into the training, if it was properly licensed, if it was legal (e.g. not contaminated by CSAM), or how the human component of RLHF was sourced - in US models, for example, stories about exploitation like [2] have been floating for years.<p>Assuming us Europeans finally get our act together, I think it is better for our long-term future (and the ethical problems) if we manage to get a baseline of training input and data ourselves, from scratch, with everything being ethically sourced.<p>Oh and, while we're at it, the EU has 24 official languages plus a host of minority languages. Most LLMs focus on the English, German, French and Chinese languages, but everything else is... left behind at best. An European model with actual funding and proper data sources might be able to significantly reduce that.<p>[1] <a href="https://www.taiwannews.com.tw/news/6245677" rel="nofollow">https://www.taiwannews.com.tw/news/6245677</a><p>[2] <a href="https://www.theguardian.com/technology/2024/apr/16/techscape-ai-gadgest-humane-ai-pin-chatgpt" rel="nofollow">https://www.theguardian.com/technology/2024/apr/16/techscape...</a>
The Chinese models are almost certainly taught to comply with "Chinese values" in the RLHF step, not from filtering the training data. There may be a few things which are too radioactive to be allowed even in the training material - but that's more likely to be things like child abuse images for a visual model, things non-Chinese values also have an issue with.<p>I'm pretty sure no county taking a stab at making their own model for sovereignty purposes will let "proper licensing" stand in their way.
This already exists <a href="https://eurollm.io/" rel="nofollow">https://eurollm.io/</a><p>How do people not know about it and keep making stuff from scratch?
> Most LLMs focus on the English, German, French and Chinese languages, but everything else is... left behind at best<p>Current frontier models (closed and open) are already really good at small languages too. I use them in Finnish sometimes, and the language is immaculate. They underestand even somewhat obscure dialects. Multilinguality seems to be a mostly solved problem.
It really doesn't matter if the model sucks and doesn't perform well. Given the funding amount and their lofty ambitions, it seems very unlikely they will be able to pull it off properly.<p>Yeah China and US models have baises but so will any model. The biases do not get in the way of the product though. You don't open those models just to ask for what happened in Taianaman square or if Taiwan is a state. You dont ask ChatGPT to generate CASM. But they are very good at the tasks you actually expect from a LLM. If you fail at that, nobody will use your model no matter how "ethically sourced" a colonizer-based entity like Europe made it.
> no matter how "ethically sourced" a colonizer-based entity like Europe made it<p>The attempt is laughable, buy every country should at least try to keep up with frontier technology, even if they fail massively or are massively underfunded.<p>On the other hand, it's arguably wasteful for an incompetent govt to do something like this, since the money will almost certainly not be well spent. It will just go to people good with MS Word. That's the likely failure mode for such NL innovation projects. The actual solution is a culture shift, but that is much harder if not impossible to pull off and requires decades. But we (NL people and govt) should work towards that. Most likely all these govt led innovation attempts are a sad waste of tax money.
The culture shift that has generated this is the same one that causes the other story on HN this morning about xAIs gas generators being a national security issue. Ie one towards corruption graft and the public ill.<p>I don’t want Europe to model itself on the US, whatever the economic gain. Hopefully we are large enough to find a third way between China and the US.
> Most LLMs focus on the English, German, French and Chinese languages, but everything else is... left behind at best.<p>that is not true, so please read before make an opinion. The French Mistral project shipped seven+ years ago with 140 languages for example.. language translation was the first LLM task from 2015
There is something north of 8% OCR error rates.. that will hurt model quality!
Uh, some would say it's easy to determine what input went into the training for kimi and qwen.. since they were caught stealing it from American labs. Some cultural cliches may never change.
It's well-known that all commercial models are based on stolen content. That doesn't mean there is no filtering/censoring, just that the censoring likely depends on where it's happening…
> <i>since they were caught stealing it from American labs. Some cultural cliches may never change.</i><p>Has a formal lawsuit been brought to bear? Given, Anthropic & OpenAI are being dragged through courts for copyright violation (or stealing, as you'd call it, if the companies involved were culturally Chinese) by newspapers, publishing houses etc; one'd think they'd pass on some of that medicine to Alibaba, which does have business entities registered in the US.
> since they were caught stealing it from American labs<p>...and "good guys" the American labs were caught stealing from authors all over the world[1].<p>[1]: www.npr.org/2025/09/05/g-s1-87367/anthropic-authors-settlement-pirated-chatbot-training-material
<i>.... Anthropic began buying books in bulk, tearing off the bindings and scanning each page before feeding the digitized versions into its AI model, according to court documents.</i><p>Wow. This image of Anthropic employees ripping books apart to use them to train models is a powerful one, seems like an inflection point in the history of information.
>> Some cultural cliches may never change.<p>Let’s just gloss over the monstrous amount of copyrighted and pirated material the American labs trained on. China bad. American good. Some cultural cliches never change.
It is crazy that anything Europe gets so much hate. IMO it is important to build models within the boundaries of smaller nations, using their own language. Research has to continue even if it is outside of US and China.
I was somewhat excited about these "sovereign" open models in the beginning, but it became soon apparent that they're not gonna be anything but toys compared to SOTA.<p>The problem is that there are a lot, at least 30, of these small projects scattered around, funded for a few years as some ad-hoc temporary coalition of universities and businesses. Those simply cannot compete with businesses spending tens of billions on developing these. Especially when you have to bring a spoon to a gunfight restricting to "clean" data.<p>Multilinguality is essentially a solved problem, and restricting too much on one language with more limited resources is gonna make the model worse in that language too.
This platform is running from the US and frequently accessed by US people. What else would you expect?
It’s not that it gets hate so much as it’s akin to watching them make announcements that they’re going to make a European google/facebook/tiktok.<p>Sure… they can, except at the end of the day it’s a bit late, regulatory burden will make it comparatively useless, and because of that nobody will ever use it. It will be spending a bunch of taxpayer dollars for press releases.<p>The running joke is that when these “sovereign” EU models launch, they’re going to refuse to answer anything that might involve personal information such as Elon Musk’s birthday.
I guess if you are strict about it, making derogatory comments like yours is indeed not hate. But I'm sure you are aware that "getting hate" frequently used in a more extensive meaning online, especially in the context of replies to a post and I don't see the much point in insisting on the stricter definition here.
At least with social networks the network effect is a powerful force. Foregrounding regulatory burden in that context is nonsensical. (That does not apply in the same way to models.)
That’s on Wikipedia, it’s not PII, it’s also not going to be relevant to any meaningful IRL work.<p>I challenge the assumption you can do meaningful work in this field without blatant disregard for intellectual property.<p>The idea that it’s all down to training size is clearly incorrect, as every expert human learned their craft without nearly the sum total information of the internet. Clearly there are architectural wins to be found.<p>Besides that, why would everyone just be fine with Opus level AI at best, as that’s all the US is willing to export, and I doubt China will share beyond that.<p>Sovereign AI is more important than ever after Friday.
I kinda agree, the best use of taxpayer money should be in reducing taxes to corporation that would like to compete in the market vs US and China, rather than making governments playing the game (since they very obviously can’t).
If a teenager on your street said he was going to spend $1,000 to customize his Honda Civic for his needs, you'd believe him. If he says he's going to build a brand new car, better than a Honda civic, for $10,000, you'd laugh and say good luck.
If Europe is serious about getting home grown AI fast, three simple steps:<p>1. Huge tax incentives, let the companies get grossly wealthy while paying minimal taxes. Minimum 10 years with clauses protecting "retribution" taxes there after.<p>2. Tax incentives for the founders/shareholders, just like above.<p>3. Drop worker protections to a minimum, make it easy to fire people. You only want serious/dedicated employees anyway.<p>Within 2-3 years there will be at least a trillion dollars looking to get in.<p>Don't worry though if reading that made you mad. Its absolutely not going to happen. I can think of few things more antithetical to the European ethos than smart skilled people working 80-100hrs weeks with almost no vacation to gas their founders net worth by tens, hundreds, of billions.
Some additional points to consider:<p>1. Pay the workers in company scrip and relocate the workers to a company town. That way, all workers are fully dedicated to the company.<p>2. Start importing slaves from Africa again. It worked to build up massive wealth. Should do the trick for AI as well.<p>3. Abolish the 8 hour work day. No comment needed.<p>With these 3 simple tricks, you too can get 6-7 bazillion euro AI mammoths.
This "greed is good, and should be rewarded" philosophy is one I see all too often on HN, and the entire reason why Americas political, regulatory, and business leadership have been overrun by the countries most criminally corrupt narcissists and psychopaths; why its democracy has collapsed.<p>When you reward the most selfish, corrupt, and antisocial behaviours with wealth and power, you're guaranteed to create a selfish, corrupt, and antisocial society. IMHO it's indicative of what I have dubbed Americas "mental illness epidemic"; specifically cluster B personality disorders [0] which are characterised by socially-destructive and self-destructive behaviours.<p>If that's the world you want for you and your loved ones, congratulations. You've earned it!<p>[0] <a href="https://wikipedia.org/wiki/Classification_of_personality_disorders#Cluster_B" rel="nofollow">https://wikipedia.org/wiki/Classification_of_personality_dis...</a>
If tax incentives to attract companies and at-will employment are already viewed as a destructive collapse of democracy, it's no wonder we are not getting anything done here in Europe.
3 Times As Many Europeans Move to the US, than the Other Way Around.<p>[1]: <a href="https://mises.org/mises-wire/3-times-many-europeans-move-us-other-way-around" rel="nofollow">https://mises.org/mises-wire/3-times-many-europeans-move-us-...</a>
Butcher worker protections and quality of life across all industries to (hypothethically) benefit a single one?<p>No thanks.<p>Why do you feel grinding insane hours would be beneficial to AI progress?
Exactly this. You can't have a competitive industry while at the same time heavily redistributing wealth to the point where people don't have any incentives at all.
They can be serious about home-grown AI without needing to become a libertarian capitalist hellscape. I prefer happiness, safety and privacy over competing with the US / China.
Europe should have a sovereign model on its content and languages that is trained with renewable energy and published as open source.<p>This looks like a good step in that direction.
> GPT‑NL is developed within the Netherlands and Europe. This gives us full control over the model, the data and the choices we make. We avoid dependency on non‑European providers and invest in a sustainable AI ecosystem aligned with our laws, values and societal goals.<p>I love it! So this is our answer to America and China denying foreigners access to their frontier models.. a massive 13,5M€ founding to develop souvereign european ai, trained exclusively on legally obtained documents and highest moral standards as defined in EU AI Act.
So good to see these developments. Every country should do this. I'd even say every <i>person</i> should gave their own personalized AI running on their own computers. If only the costs involved were not so astronomical.
Why? That doesn't make any sense.<p>The government would be far better off figuring out how to take commodity models and applying them to government functions where they can, with deterministic scaffolding and guardrails, to make government more efficient, optionally using RL on traces from their use to improve their performance.<p>Imagine taking models and fine-tuning them / doing RL rollouts to help automate permit application approvals, as applied specifically to Dutch permit processes. That would be a real help to Dutch businesses!<p>That type of applied AI is more interesting and effective now than just trying to make another foundational model that isn't going to work well or do anything of economic value.
Another thing they could do is try to attack the the suppply chain issues. Try to form an alliance to block RAM deals or something, or to get fabs on EU soil, making HBM for the people. We have some bargaining chips, especially when banding together with a few large EU states. Not as EU, just a few specic countries. No bureaucracy, just elite trade diplimacy. Probably best done in secret so the big labs don't catch wind of this. Any NL/DE/UK/FR/CH/PL/IT govt people reading this?
> Why?<p>Because then the USA can't just turn it off.
I think it will be cost effective at some point. Computers were limited to research institutes before the personal computer arrived.
"Champions of a European AI model should ask themselves if a European effort would be more effective than Meta, which this year will spend more on chips ($125 billion) than Germany spends on defense ($114 billion) and offer salaries of over $100 million to attract the best researchers, and is still failing to catch up. Elon Musk tried and failed to build a good AI model."<p><a href="https://www.siliconcontinent.com/p/nineteen-thoughts-on-ai-and-europe?hide_intro_popup=true" rel="nofollow">https://www.siliconcontinent.com/p/nineteen-thoughts-on-ai-a...</a>
Nvidia will certainly be pleased.
Interesting that this got posted now: the project is receiving increasingly more skepticism lately in the Dutch tech scene [0], and I think that’s fully justified.<p>[0]: <a href="https://www.quotenet.nl/zakelijk/a71588202/techondernemers-maken-nederlandse-chatgpt-belachelijk-kan-echt-helemaal-niks/" rel="nofollow">https://www.quotenet.nl/zakelijk/a71588202/techondernemers-m...</a>
What is the exact skepticism? The only thing I could get from that was from some "tech entrepreneur":<p>> GPT-NL was never built to compete with Claude or ChatGPT. It was trained exclusively on licensed data, and is intended more for governments and companies where privacy and compliance matter more than raw performance.”<p>That's it? That it didn't aim to compete with SOTA models? Maybe this is something you have to start with <i>something</i>, then ramp up, rather do what only a select few labs been able to do, start with really big models. Especially if you're resource constrained, which since this is a government project, I really hope for the sake of the tax payers it was.
I don't understand countries (especially governments) wanting to have their own models when there are already pretty solid open source (weights) models out there.<p>Countries should want control over _where_ the compute is happening rather than _what code_ is running.<p>What's wrong with a country hosting a Kimi, Qwen or GPT-Oss on their hardware for their government work purpose?
There's an absolutely massive cultural and behavioural bias in those models. Models will suggest things like "go to the hospital" for things that require GP appointments, "just drive three hours" while it's faster to go places by train, and so on. They will do it in anglicised Dutch (compound words split, English-like grammar structures) that's perfectly understandable, but the cultural bias is there if you know to look for it.<p>Furthermore, the expertise in designing and training these models is valuable as well. The existing models are good as a starting point in terms of learning from previous mistakes, but we should not just let a handful of American and Chinese people keep the knowledge and expertise.<p>One problem with this particular project, though, is that copyright has been enforced for Dutch LLM training before, and the AI industry cannot exist without massive scale piracy, the likes of which has never been seen before. A lot of Dutch training material exists in pirated books that AI companies in countries that do not care about copyright have access to, but are exempted from the training set here. The impact of enforcing copyright on an AI model will be quite interesting to see.
It is not about the country but the language. Most llms have poor or no support for Dutch.
Idk which models you refer to, but I tested a bunch recently, and they performed well on Dutch. Only the smallest, such as qwen 3.6 27B, made up words and switched languages.
There's a large gap between making up words and an actually native text distribution. LLMs have a clear pattern, clear tells, a "feel" in English, and it's normally even more pronounced in non-English languages.<p>Lots of bias towards English sentence structure, idioms, etiquette, etc.
There would be a bunch of value in having, say, a good 30B-class model that used my local language as well as it does English. There's lots of cases, especially in the government sphere, where local processing is a requirement and frontier-level capabilities aren't required. Making those cheap to run seems like a fine goal.
I don't understand this. Even if that were true (and it isn't in my experience), a model that is trained on a Dutch corpus and arguably "knows Dutch well" but has the reasoning and comprehension abilities of a three year old is useless in any case. I'd rather use a model that can only speak English and put an automatic translator around it.
An LLM is an encoding of a culture, a way of viewing the world.<p>They are not neutral technology, they are a direct representation of the training set that has been chosen and how they are aligned.<p>In many ways, they are ideology made code.<p>If we leave building them to the US and China, only their way of seeing things will be digitized.<p>I don't like the idea of that.
Yes and also, US and Chinese models are censored in different ways. US models are way too prudish for personal use in Europe because they're afraid to piss off religious investors. Chinese models are too censored on history and current affairs, eg the tiananmen massacre never happened stuff like that.
Chinese models aren't censored as much as you think, you can download the model and run it somewhere else and they will happily tell you about Tiananmen Square. Or heck, ask DeepSeek via Openrouter, it will do the same.<p>The censorship works kind of like with Fabel, it kicks in before the model responds.
Really? Because I'm pretty sure that at least every two days there's a active post with a top voted comment along the lines of "The EU isn't doing AI themselves, they are so hosed".
Why should Dutch people be expected to make do with models 99% trained on American/Chinese cultural context and language?
Maybe the Dutch really really want an LLM that tells them the truth as straight as possible no matter how harsh - that might be tricky
Understood, but they could fine tune base models on their own cultural context and language. Why reinventing the wheel?
I thought finetuning data can't contradict foundation models, and anything that are inconsistent with the standard LLM American-Chinese split personality would be rejected?
They could apply the Polder Model of consensus decision making with a mixture of experts.<p><a href="https://en.wikipedia.org/wiki/Polder_model" rel="nofollow">https://en.wikipedia.org/wiki/Polder_model</a>
Funny, that's what I thought when PewDiePie set up his monster AI rig and what he called a 'council'. Quote:<p>"PewDiePie has built a custom web UI for self-hosting AI models called "ChatOS" that runs on his custom PC with 2x RTX 4000 Ada cards, along with 8x modded RTX 4090s with 48 GB of VRAM. Running open-source models from Baidu and OpenAI, PewDiePie made a "council" of bots that voted on the best responses, and then built "The Swarm" for data collection that will become the foundation of his own model coming next month."<p><a href="https://www.tomshardware.com/tech-industry/artificial-intelligence/pewdiepie-goes-all-in-on-self-hosting-ai-using-modded-gpus-with-plans-to-build-own-model-soon-youtuber-pits-multiple-sentient-chatbots-against-each-other-to-find-the-best-answers" rel="nofollow">https://www.tomshardware.com/tech-industry/artificial-intell...</a>
This gets better short-term results for a fraction of the cost, for sure, but what do you when China places an export control banning the release of open weight models? If you don't have your own talent, you're then relegated to using a base model from 2026 or whatever the cutoff date is, forever. That defeats the purpose of a 'sovereign' model made for and by your people.
Oh, it's all fine with cultural context here -- we don't even dub English language movies here because we are that cheap
[flagged]
<i>>Countries should want control over _where_ the compute is happening </i><p>Yeah but Europe doesn't build any computer hardware, and EU Green eco-communists and NIMBVYs don't want to have data centers built in their backyard, so the only way left for EU consultancies to milk taxpayer money for the AI bubble, is shipping a sovereign AI model for each country/language.<p>Watch out US tech sector, we're coming for you. Feel our wrath.
Have you heard of ASML? NXP?<p>Ignorant comment
Please don't move the goalposts. What computer parts does ASML or NXP make?<p>ASML only makes the lithography machines, 85% of which go outside the EU (let that sink in). And then fabs in Taiwan, Korea or the US use those ASML machines to etch US IP for computer chips. EU doesn't make any computer parts domestically.<p>And NXP mostly makes various microcontrollers and small chips, not high margin IP decenter centric parts like ASICS, FPGAs, CPUs or GPUs.<p>So not only are you the ignorant one here, but you also have the audacity to insult others with so much confidence.<p>@dwa3592 below. Firstly, why are you moving the goalposts in bad faith again just to stir an argument? What does that have to do with my original comment?<p>And secondly, there's other lithography machines out there, not just ASML.<p>And thirdly, the IP Nvidia, AMD, etc develop to etch on silicone via ASML machines makes them more valuable than ASML.<p>Fourthly, repeating my "let that sink in" phrase is just childish and low-IQ trolling, unworthy of this platform.
Europe is currently hosed because we made the mistake of trying to develop economies complementary to the US and china.<p>That was a big strategic mistake. In the US case it was borne of the mistaken belief that we shared values and were partners.<p>But don’t mistake the situation for lack of innovation of capability. Europe is currently adapting, but I think the success of Ukraine is one reason to be optimistic that current adversity might actually leave us better off in the long run.<p>Corrupt countries with broken legal systems tend not to fare that well in the longer run.
In a recent podcast, it was summarized as:<p>ASM (International) makes machines that add material to a silicon wafer (deposition).<p>ASML makes machines that remove material from said wafers (lithography, etching)<p>(I was a bit surprised that's not combined in 1 machine. But let's move on)<p>Then Besi makes machines to stack / interconnect / package those ICs into a package. I'm assuming pick & place machines are other companies' turf.<p>The above are all Dutch companies, operating a pretty important section of the tech stack.<p>Iirc there were (& probably still are) some IC fabs in Europe, but mostly older nodes (like useful for microcontrollers used by car manufacturers. Wikipedia has a list). So for SOTA smartphone SoCs it's off to Taiwan (TSMC), South Korea (Samsung) or China (who makes everything, including smartphones & the chips going in there).<p>So as far as EU goes, the capabilities are mostly there. Skilled workforce? Check. Money? This is a rich continent.<p>What's missing is the guts to say "hey, let's dump €100B into this & make ourselves some laptop & server CPUs!".<p>But now the important thing: several of such initiatives are starting to bear fruit, and b) confidence that EU <i>can</i> do such things, is growing.<p>As for bureaucracy / red tape... sigh... (won't be fixed any time soon)
>>ASML only makes the lithography machines<p>Woah! only lithography machines???? it is literally impossible to make any device capable of running anything close to AI without ASML. Let that sink in.
Funnily ASML owes its current success partly to US funded research (straight from Wikipedia):<p>> Two years later, it joined a consortium, which included Intel and two other U.S. chipmakers, in order to exploit fundamental research conducted by the US Department of Energy. Because the Cooperative Research and Development Agreement (CRADA) it operates under is funded by the US government, licensing must be approved by Congress.[12]
Why are you acting childish and petty? I said EU hasn't got AI compute manufacturing(aka no equivalent IP to Nvidia and AMD and no equivalent to TSMC or Samsung fabs), not that it doesn't have lithography machines manufacturing.<p>Surely you understand that while you can have the latter, you can also lack the former.
The World's Most Important Machine ;)<p><a href="https://www.youtube.com/watch?v=MiUHjLxm3V0" rel="nofollow">https://www.youtube.com/watch?v=MiUHjLxm3V0</a>
Most important machine ... built on US IP, subject to US export restrictions, used to manufacture high value US IP, in factories outside the EU, so profits of those chips goes to US. A point I have addressed over two times already.<p>Also ASML even threatened to leave the NL if the Dutch government doesn't do what they want on taxes and labor policies. So having only a single card to play that EU can loose at any time, it's not putting EU tech sovereignty argument in a good light.<p>The "wahabout ASML" that keeps being spammed by people here, isn't proof of EU compute and AI sovereignty. It's the exception which is why it's the only thing people can mention on EU tech and they DDoS you with it as if that changes anything.<p>Are people here that petty that they can't stay on topic and argue in good faith and instead need to hijack your argument to go on offtopic whataboutism for a cheap gotcha spamming "whatabout ASML" on unrelated arguments?
>ASML only makes the shovel making machines
>>Yeah but Europe doesn't build any computer hardware,<p>Well, then this is will be a good start.
EU bureaucrats are too busy trying to keep the welfare/pension system from collapsing, defeating Russia, supporting Ukraine, managing the fossil fuels energy shortages, figuring out how to nerf Chinese EVs while supporting domestic car companies, and restricting social media free speech to make sure the "far right" don't win elections.<p>So of course, semiconductor manufacturing sovereignty is very low on their priority list.
How many in that list of things is the US also doing?
The US is a single country. Russia is not on the US' doorstep. The US has its own oil. The US prints the world reserve currency.
Different scale for those problems. Way different scale. US is monetary rich, oil rich, energy rich, manufacturing rich, and doesn't suffer from russian aggression at its borders. US is so bored from how rich and problem free it is compared to Europe, that it can afford to keep starting foreign wars as if nothing ever happens.<p>Also back on the topic, the US managed to bring TSMC to open a cutting edge fab in the US and has already been operational for a while. Which already puts it way ahead of the EU on this front as well.<p>The thing is, US is much better on actually making things happen when push comes to shove. It saw it's deficient and vulnerable on domestic semiconductor manufacturing, it then made it happen with TSMC. It's doing the same with domestic ship building with Korean partners.<p>US might be slow moving, but somehow EU is even way slower at realizing and addressing its vulnerabilities, only waking up when it's far too late, causing it to pay a much more painful price for sleeping at the wheel (Russian invaded Ukraine in 2014 BTW, not in 2022, and they were building another gas pipeline with them), and when this type of own-goaling keeps repeating enough times you see the correlation with EU's decline as their economic rivals keep biting more and more market share from their industries as they sleep on critical changes and developments.
Anyone who uses quotation marks around far right has clearly stated their allegiance.<p>It is no wonder then that such a person would do their best to poo-poo the worlds most successful peace project and the bastion of rule of law.
That's what bureaucrats are supposed to be doing BTW.
And yet, the ESMC Dresden fab is getting built. And now there's a Chips Act 2.0.
Previously posted on 02-dec-2023 <a href="https://news.ycombinator.com/item?id=38497495">https://news.ycombinator.com/item?id=38497495</a> 3 comments
They’re building a competitive-quality model, from scratch, with fair compensation to content owners, for €13.5 million? Something’s wrong with this picture.
I feel that not only is Europe losing its independence to the US and China, but it does not even try to take part in the race.<p>Unlike the US, Europe has no California-level VCs. I don't expect hundreds of billions of Euros to be poured into long-shot projects.<p>Unlike China, Europe has neither cohesive public investment at the global level nor the drive to grow. Long-term investments have a lot of words, a lot of regulations, a lot of proxy goals, but there is neither a lot of money nor urgency. It was captured by this post: <a href="https://x.com/piotrsankowski/status/2065795919623438546" rel="nofollow">https://x.com/piotrsankowski/status/2065795919623438546</a><p>So yeah, both in economy and warfare, Europe dooms itself to be in the hands of the US, China, or a mix of both.
> Unlike the US, Europe has no California-level VCs.<p>Some would consider that a good thing. There is a lot to be said for VC in recent years not being beneficial for the economy, certainly on an individual level, other than "number go up".
Sure.<p>At the same time, it made in many cases EU dependent on the US. A lot of governments are basically dependent on MS Office or Google Cloud.<p>With AI, it is even more strategic.
My impression is that in Europe much fewer people are convinced that AI-maxxing is necessary or even a net benefit.
And if you ask a bit more, you'll find that those same people are very likely to daily-drive AI models, desktop and phone operating systems and various other software critical to their professional and personal lives from US companies. And buy tons of Chinese products over Chinese or US e-commerce platforms.<p>What people say matters much less than what they do.
Much fewer people in Europe are convinced maxxing anything besides work/life/life balance and generous social support are a benefit.<p>All the stuff that doesn't help an economy grow or pay for the future.
I have rather deep concerns about our future with AI.<p>Yet, I'm still sending hundreds of dollars to US companies providing it. I'd much rather send it to EU companies.
Americans are also far less into AI than a startup forum like HN will make you believe.<p>In real life, there are a few AI maniacs that make their entire identity about how they use AI, but it's hardly the sensation that the internet will make you believe. I don't believe there is any difference between the USA and Europe, although the lack of employee protection does mean that it's a lot easier for Americans to lose their jobs when their managers get lured in by AI companies.<p>After all, the entire AI bubble is all about VCs and startups hyping each other up until profits magically appear.
Which has nothing to do with VCs, just with sourcing decisions.
If there were European MicroSoft or Google, there would be a preference.
Well those companies exist because of VC's, and the nation's business freedom compared to Europe
> There is a lot to be said for VC in recent years not being beneficial for the economy<p>What a wild statement, VC's are behind most of the growth in the US economy, and they directly drive up wages in tech. I'd be fascinated to hear a valid complaint of VC's that isn't just money envy
Some people consider it a good thing that communists boiled people's hands as torture. Some people consider it a good think that Iran massacred 10,000s of its own citizens. Some would consider it a good thing that Israel killed all Palestinians in Gaza.
Europe decided to regulate the hell out of foreign AI instead of investing in their own systems. It's sad to see the European continent lost the race to create a decent startup ecosystem (no decent search engines, social networks, cloud, mobile OS) and now it seems to be hellbent in losing this battle.
<i>>It's sad to see the European continent lost the race to create a decent startup ecosystem</i><p>What's ironic and sad at the same time is that pre-2022 Russia's Yandex(domestic Russian variant of Google) was lightyears ahead of what EU, a significantly richer and more capable block, had. IIRC, their reverse image search was so good, they had to nerf it because people were using it to find the identity of people from photos.<p>Same for Israel, their tech sector is probably greater than the EU one combined<p>Absolutely shameful how the EU kept managing to snatch defeat from the jaws of victory over and over.
Not surprising. All the Yandex people that moved over to here to the Netherlands that I know of are astounded about the insane difference in the tax burden between what they had in Russia and what they have in Western Europe (when their 5 year tax discount ends, that is). If the government takes the bulk of your income after a certain point, there isn't really that big of a push to create ground-breaking technology.
> If the government takes the bulk of your income after a certain point, there isn't really that big of a push to create ground-breaking technology.<p>I'm skeptical that high taxes is a large reason to lose to California of all places. Maybe in some important sense CA has "earned" that via talent and funding density while NL hasn't (from the perspective of a company, to be clear)
NL is a corporate tax haven in Europe(and also compared to California), that's why almost all US and foreign companies have their corporate HQs there, Dublin or Luxembourg.<p>Companies go there because taxes are low for them, not necessarily for their employees(ignoring the NL 5 year tax break for foreigners). It's kinda like the US Delaware of Europe.
We also have much nicer societies than Russia. It's a dictatorship ffs.<p>And yes having nice things cost money. And a safety net is important.<p>I would never want to live in America even if I got 3x my wage. Nor Russia of course but that's a foregone conclusion.
I think much of that is because European customers (both private and business) tended to prefer American suppliers over suppliers from European countries that were not their own. That may have something to do with most people in IT being quite fluent in English, while European products were all-to-often half-heartedly translated from German/French/Spanish/Polish/Italian/Ukranian.<p>In many cases, well-established and well-liked European services have been supplanted by American counterparts that came later and were not really better in any way. They did usually have much more money to burn though, undercutting pricing until competition was dead.<p>I'm speaking in the past tense, because now for the first time in the couple of decades I can remember, there seems to be a somewhat commonly held preference for European suppliers.
Frankly, being Russian myself, I'm also very disappointed by the state of tech in Europe.<p>But you know what hurts the most? That <i>I know it wasn't always that way</i>.<p>I'm sitting right now in the same country that invented the Minitel, built out the TGV network and the Grands Projets, and don't even get me started about the weird and wonderful machines they've got in that museum in Mulhouse, hell, you could go back in time to Gustave Eiffel. Industry and ambition used to be here. It was almost physically painful to discover that it seems to be gone now.
You are saying that as if China or the US are completely isolated from the EU.
We live in a globalized world whether you like it or not, and every supply chain spans multiple countries.<p>Arguably, staying out of the AI "race" is a good thing
I'll play devil's advocate a little bit - I'm not sure it is losing its "independence" by not taking part in the race. It could very well be that it is gaining independence from tech and choosing a "second mover advantage" to decide how it gets deployed after seeing how it impacts everyone else. Let the US and China experiment on the bleeding edge (and their citizens feel the effect, both good and bad), and then be picky about how you use it.<p>I don't know if it is the right strategy but there's certainly a legitimate strategy in there.
The problem is recursive self improvement creating a very difficult gap and the fact that power, compute has a lag from when you invest and when data centers come up.<p>You also can't just spin up a research team out of nowhere.
These things are true, however:<p>1. The labs in the US and China don't seem to have any problem selling (or even giving) access to these models right now.<p>2. If some kind of take-off happens which makes that not true, my bet is all bets are off on what that outcome even looks like. What would the economic paradigm even be under a superintelligent AGI? Do you think "it" is going to listen when Trump says "you can't work with Europe"?<p>There's a whole bunch of grey in between the two, for example only having access to second rate models, but I'm not sure that particularly matters if the strategy is "second mover."
Let’s autonomous Russian drones, and Europe is at mercy of two other empires, who capitalize on this opportunity.
Serious question: what does any of that have to do with the submitted article? Where is the relevance to the topic at hand?
Europe is not a country.<p>Regulations are not even throughout each of the 27 member states. Each country is relatively small in the world stage.<p>Until EU progresses towards federalization, discussing this is a moot point.
> Unlike the US, Europe has no California-level VCs. I don't expect hundreds of billions of Euros to be poured into long-shot projects.<p>My ex-neighbor (when I was a teenager, living in Belgium) and very good friend really wanted to make it big. He became a chip engineer, moved to California, raised money for a first startup (it tanked) then raised money for a second startup. He made the world a better place (he created some very specific micro-inverters for solar panels) and made a $$$ exit.<p>The EU saw exactly zero of the wealth he created and he's never ever coming back to what he considers a failure of a continent.<p>That's the problem: many of the great minds with the mindset required to do great things already left the EU.<p>> So yeah, both in economy and warfare, Europe dooms itself to be in the hands of the US, China, or a mix of both.<p>And in energy (economy is energy and energy is economy, and China really understood that) the EU doomed itself to be in the hands of Russia.<p>We <i>are</i> a failure of sinking continent.
> economy is energy and energy is economy, and China really understood that<p><i>In former times the energy monopoly was called "The Power Company"; we intend to give this name an entirely new meaning."</i><p><i>– CEO Nwabudike Morgan, "The Centauri Monopoly"</i>
Europe is a great place to live if you just want to float through life.<p>The US is a great place to live if you have talent, want to work, and want to reap the rewards.
This is not an open source model. In that sense I think the sovereign claim is a bit strange. It's the data providers that determine access to the model.
> A total of €13.5 million has been allocated to the project.<p>> This public investment underlines the importance of an independent, trustworthy and future‑proof Dutch language model.<p>It does, but not in the way you think it does.
A total of €13.5M has been allocated to the project.<p>I guess we’re going for GPT2 level capability?
We already had GEITje but it was banned by the courts. Of course it can still be found because the entire internet is not subject to Dutch law. But it did manage to stop development :'(
I wonder with these stories. Why are there so many individual country efforts? We know the scale needed with scaling laws / capital / energy. Most of these countries alone can barely compete (even large groups of them would struggle.<p>Why don't they work together on it? Companies like Airbus have already been able to do that with aircraft.
What really matters is the sovereign capability to finetune the LLM models. Any model could be vetted and tested, but you need finetuning/lora training to prevent the model from being outdated.
I'm making a Dutch dictionary and would be interested to see how this model would fair in evals vs non specialized ones. I've tested a variety of models for <a href="https://hetnederlands.com" rel="nofollow">https://hetnederlands.com</a> content and differences can be big
Is it a proposal or a model?
And if it is a model, how fies it fare on benchmarks?
Burning tax money. I dare to bet this will never lead anywhere.
Honestly, I used to think the 'sovereign model' was a waste of money. But recently, with the US logic of restricting model exports, I've come to think that if things go south, they could even cut off allied nations. So now the sovereign model seems reasonable to me. That, in turn, means US influence is deteriorating. And that probably isn't such great news for American businesses.
What are they going to train with 13.5M really? We're a tiny company in Amsterdam in Holland and we've got "only 64x B300 to train on" so we could never make an LLM I thought, since we've got only 4M in compute.<p>And they're going to train an LLM with all kinds of extra difficulties compared to OpenAI for just 13.5M?<p>The very first Llama was 16M for one training.
This is too little, too late. Europe really need to start focussing.<p>All these tiny niche models are perhaps fun as an academic exercise or great for the researchers resume but I highly doubt that they'll add any value or will be used for anything serious.<p>Even if this becomes a somewhat decent model with a fantastic understanding of "gezellig", "kring verjaardag" or "pannenkoeken", how many people will interact with it before the limits of it will drive them back to a frontier model?<p>Even if the purpose of this is government & other regulated industries, do we really want our government to use a poor model? Either do it right or don't do it at all.
Prices for training have dropped immensely in terms of research required, code efficiency, algorithmic/sample efficiency, and possibly also hardware (I'm not qualified to say without looking it FLOPS/dollar, or even to be certain that's the right metric here).
I really think countries should build a sovereign _ecosystem_ and sovereign models are an excuse to achieve it.<p>An ecosystem is the tribal knowledge, revolving door of talent, known processes etc.<p>If the end goal is to make a half assed Dutch speaking model, I think it won’t cut it. I don’t see anyone using it over Gemma 4b that runs on my laptop.<p>An ecosystem is more durable and has desirable second order effects.
How do you use it?
So cute.
Supposedly this model also aims to treat publishers of all sizes well. Looking forward to its launch soon :)
[dead]
[flagged]
This really isn’t what HN is for. I think you should read the guidelines. I’ve looked at your posting history and it is all like this: relentless negativity with low information content.
Why are you being nasty? And in what world is the Netherlands a second-world nation?
> Excluding harmful content<p>#define(HARMFUL)<p>[edit] Downvoters please tell me what the problem is with specifying this?
I didn't down vote you, but you aren't really adding anything to the conversation. This type of pithy comment might be fun on Reddit, but at HN, we try to provide more constructive, and information rich, comments.
How about fixing whatever the hell prevents competitive private LLM vendors from appearing in Europe?
They’re still debating that, they’ll get back to us soon I’m sure.
you mean getting rid of checks and balances, environment protection laws, anti-corruption laws?
> A total of €13.5 million has been allocated to the project.<p>This is not even funny. If you want a competitive AI industry, you need to invest much more heavily in infrastructure first, building models second.