ON
← Back to feed
"No longer an American John trying to speak Slovenian, but our Slovenian John"
Slovenia💻 Technology4 days ago

"No longer an American John trying to speak Slovenian, but our Slovenian John"

Slovenia has developed an open-source large language model called GaMS, designed specifically for the Slovenian language. The project, led by the University of Ljubljana's Center for Language Resources and Technologies under Simon Krek's guidance, aimed to create a foundation for artificial intelligence development in Slovenian while enhancing technological sovereignty. The model was trained on extensive collections of Slovenian texts and is now available for use in research, education, industry, and public administration. It supports longer documents and is culturally adapted to understand local expressions, references, and context. The project involved two phases: building infrastructure through data collection and model development, followed by practical applications in collaboration with companies.

Slovenija se v zadnjih letih intenzivno ukvarja z razvojem umetne inteligence (UI), s specifičnim osredotočenjem na razvoj jezikovnih modelov za slovensčino. Ključna točka tega napora je projekt **PoVeJMo**, v katerem so razvili veliki jezikovni model **GaMS** – prvi odprtokodni model svojega velikostnega razreda za slovensčino. Projekt je trajal skupaj štiri leta, od leta 2023 do 2026, in je vodil Center za jezikovne vire in tehnologije Univerze v Ljubljani pod vodstvom Simona Kreka. Cilj projekta je bil ne samo razvoj modela, temveč tudi vzpostavitev temeljne infrastrukture za razvoj umetne inteligence v slovenskem jeziku, kar ima pomembne posledice za tehnološko suverenost države.

Razvoj je potekal v dveh fazah. Prva faza je vključevala zbiranje slovenskih besedil, pripravo podatkovnih zbirk in razvoj samega modela. Za to so uporabili slovenski superračunalnik **Vega** in tudi evropski superračunalnik **Leonardo** v Bologni, ker so bile potrebne ogromne računsko moči. Druga faza je bila usmerjena v razvoj praktičnih aplikacij, kjer so sodelovala podjetja, ki so model prilagodili za konkretne potrebe gospodarstva. To je pomenilo, da je GaMS ne le teoretski model, temveč tudi uporaben orodnik za različne sektore, kot so zdravstvo, industrija, kulturna dediščina in informatika.

Na zaključnem dogodku projekta, ki je potekal na Fakulteti za računalništvo in informatiko Univerze v Ljubljani, so predstavili rezultate. Model GaMS je kulturno prilagojen slovensčini, kar pomeni, da bolje razume slovenski način izražanja, domače reference in kulturni kontekst. Glavni razvijalec Domen Vreš je poudaril, da je model sposoben razumeti in generirati dolge dokumente, kar ga čini primernim za raziskovalne, izobraževalne in gospodarske namene. Njegova prilagodljivost je omogočila tudi samostojno izvajanje nalog in dodatno učenje v okviru Slovenske tovarne umetne inteligence.

Podjetja, kot so **Better**, **Vitasis**, **Špica**, **Semantika** in **XLab**, so sodelovala pri prilagoditvi modela za specifične potrebe. Na primer, Better je razvilo rešitev, ki samodejno pripravi strukturirano medicinsko dokumentacijo iz pogovora med zdravnikom in pacientom. Vitasis je razvilo modele za prepoznavanje slovenskega govora, prilagojene medicinskemu in industrijskemu okolju. Špica je model prilagodila za glasovno upravljanje skladiščnih in proizvodnih procesov v več jezikih, Semantika za pripravo večjezičnih muzejskih vsebin in interaktivnih predstavitev kulturne dediščine, XLab pa za generiranje opisov računalniške infrastrukture v programski kodi.

Nadaljnji razvoj modela bo vključeval dodatne funkcionalnosti, ki bodo omogočale bolj samostojno delo modela, ter prilagoditev za različne sektorske potrebe. Prorektor Univerze v Ljubljani M. je opozoril, da bo novi superračunalniški sistem **Frida**, ki je bil nedavno zagnan na fakulteti, omogočal še boljši razvoj modela. Tako se Slovenija postopoma vključuje v svet umetne inteligence, z izjemno pohvaleznim pristopom k svojemu jeziku in kulturama.

2 reports

24ur (POP TV) logo24ur (POP TV)IndependentCenter4 days ago
What will happen to the Slovenian language in the age of artificial intelligence?

The article discusses Slovenia's efforts to preserve and develop its language in the age of artificial intelligence. It highlights the project 'Povejmo' at the Faculty of Computer Science, which invites institutions to contribute texts to further develop a Slovene language model called GaMS. The goal is to ensure that Slovene remains fully functional and relevant even with the rise of AI technology. The piece emphasizes the importance of maintaining linguistic heritage in the face of technological change.

Bias read (Center): The article presents a balanced discussion on the role of AI in language preservation without overtly favoring any political ideology. It focuses on academic and institutional efforts rather than partisan agendas, though the topic itself has some political relevance due to national identity concerns

RTV Slovenija (MMC) logoRTV Slovenija (MMC)State / PublicCenter4 days ago
"No longer an American John trying to speak Slovenian, but our Slovenian John"

Slovenia has developed an open-source large language model called GaMS, designed specifically for the Slovenian language. The project, led by the University of Ljubljana's Center for Language Resources and Technologies under Simon Krek's guidance, aimed to create a foundation for artificial intelligence development in Slovenian while enhancing technological sovereignty. The model was trained on extensive collections of Slovenian texts and is now available for use in research, education, industry, and public administration. It supports longer documents and is culturally adapted to understand local expressions, references, and context. The project involved two phases: building infrastructure through data collection and model development, followed by practical applications in collaboration with companies.

Bias read (Center): The article focuses on the technical development of a language model for Slovenian, emphasizing its applications in various sectors. There is no political framing, controversy, or ideological emphasis present in the content.

Keep the news honest.

ObjectiveNews is reader-funded and ad-free — we show you the bias instead of hiding it. Support independent journalism for €5/month.

Become a Supporter

Related stories