Skip to content
Change the repository type filter

All

    Repositories list

    • Testun I Leferydd Techiaith. // Techiaith Text To Speech.
      Python
      MIT License
      0002Updated Dec 23, 2024Dec 23, 2024
    • Corpws o frawddegau CC0 mewn fformat jsonl, gyda rhannau ymadrodd y tocynnau (geiriau etc.) wedi'u tagio â thagiau Universal Dependencies. | A Corpus of CC0 sentences in the jsonl format, tagged with Universal Dependency part-of-speech tags.
      Creative Commons Zero v1.0 Universal
      0300Updated Nov 10, 2024Nov 10, 2024
    • Prototeip o leisiau dwyieithog Cymraeg-Saesneg all-lein [Piper](https://github.com/rhasspy/piper) ar gyfer iOS. // Prototype of bilingual Welsh-English offline voices [Piper](https://github.com/rhasspy/piper) for iOS.
      Swift
      GNU General Public License v2.0
      5000Updated Oct 22, 2024Oct 22, 2024
    • Brawddegau Prawf Tafodieithol TTS // TTS Dialectical Test Sentences
      Creative Commons Zero v1.0 Universal
      0000Updated Oct 16, 2024Oct 16, 2024
    • Rhestr o ataleiriau Cymraeg | Welsh Stopwords List
      Creative Commons Zero v1.0 Universal
      0100Updated Sep 11, 2024Sep 11, 2024
    • Yn y demo hwn, byddwn yn gweld sut i adeiladu cymhwysiad a fydd yn derbyn sain ffrydio a anfonir trwy Websockets a chael y sain wedi'i thrawsgrifio gan ddefnyddio OpenAI Whisper. // In this demo, we will see how to build an application that will accept streaming audio sent via Websockets and have the audio transcribed using OpenAI Whisper.
      TypeScript
      MIT No Attribution
      1000Updated Aug 8, 2024Aug 8, 2024
    • Casgliad cychwynnol o URLs sy'n cynnwys testun Cymraeg / An initial collection of URLs contaning Welsh-language texts
      Creative Commons Zero v1.0 Universal
      0000Updated Jul 31, 2024Jul 31, 2024
    • Corpws o sgyrsiau cymorth Cysgliad | A Corpus of support chat messages for the Cysgliad software
      Creative Commons Zero v1.0 Universal
      0000Updated Jul 29, 2024Jul 29, 2024
    • Cod gwefan Trawsgrifiwr Ar-lein gan Uned Technolegau Iaith, Prifysgol Bangor // // The code for the Trawsgrifiwr Ar-lein website by the Language Technologies Unit, Bangor University
      JavaScript
      MIT License
      1210Updated Jul 25, 2024Jul 25, 2024
    • Lecsicon cynhwysfawr o eirffurfiau'r Gymraeg yn seiliedig ar ddata gwirydd sillafu a gramadeg Cysill | A comprehensive lexicon of Welsh-language wordforms based on data from the Cysill spelling and grammar checker
      Creative Commons Zero v1.0 Universal
      2811Updated Jun 14, 2024Jun 14, 2024
    • Gweinydd syml ar gyfer ddarparu gwasanaeth API at modelau adnabod lleferydd DeepSpeech // Simple server for providing API access to DeepSpeech speech recognition models.
      Python
      MIT License
      2001Updated Apr 16, 2024Apr 16, 2024
    • Parsiwr dibyniaethau sy'n ceisio gwahaniaethu rhwng defnydd enwol a berfol o'r berfenw // A dependency parser which attempts to differentiate between nominal and verbal verbnouns
      Creative Commons Attribution Share Alike 4.0 International
      0000Updated Apr 11, 2024Apr 11, 2024
    • Tagiwr arbrofol dwieithog ar gyfer testunau Cymraeg a Saesneg | An experimental bilingual tagger for English and Welsh texts
      0000Updated Apr 11, 2024Apr 11, 2024
    • piper-cy

      Public
      Lleisiau all-lein Cymraeg || Welsh offline voices
      Python
      MIT License
      0100Updated Apr 3, 2024Apr 3, 2024
    • Corpws o frawddegau o destun Cymraeg wedi'u trwyddedu o dan drwydded CC0 | A corpus of Welsh texts licensed under the CC0 licence
      Creative Commons Zero v1.0 Universal
      0100Updated Mar 31, 2024Mar 31, 2024
    • Fersiwn wedi'i becynnu o spacy-lookups-data gyda data lemateiddio Cymraeg | A packaged version of spacy-lookups-data including Welsh lemmatization data
      MIT License
      0000Updated Mar 31, 2024Mar 31, 2024
    • deffro

      Public
      Project i greu modelau bychain adnabod lleferydd sy'n deffro ar air neu ymadrodd benodol. // A project to create small speech recognition models that wake up on a specific word or phrase.
      Python
      0000Updated Mar 26, 2024Mar 26, 2024
    • Anonymeiddiwr Beta ar gyfer testunau dwyieithog Saesneg-Cymraeg a thestunau Cymraeg uniaith.
      Python
      MIT License
      0000Updated Mar 21, 2024Mar 21, 2024
    • Trawsgrifio ar gael drwy’r eicon microffon o fewn bysellfwrdd arferol ffon symudol
      Java
      MIT License
      0000Updated Mar 21, 2024Mar 21, 2024
    • Rhedeg modelau adnabod lleferydd Cymraeg Whisper all-lein gyda C/C++
      C
      MIT License
      3.7k000Updated Mar 21, 2024Mar 21, 2024
    • Gweinydd gwasanaeth atgyweirio priflythrennau ac atalnodi o fewn testunau Cymraeg // Capitalization and Punctuation restoration for Welsh language texts
      Python
      MIT License
      0000Updated Mar 21, 2024Mar 21, 2024
    • sense2vec

      Public
      🦆 Contextually-keyed word vectors
      Python
      MIT License
      240000Updated Mar 17, 2024Mar 17, 2024
    • Meddalwedd ac offer docker i weithio gyda Marian NMT | Software and tools for working with Marian NMT
      Python
      MIT License
      1200Updated Feb 28, 2024Feb 28, 2024
    • Demo o fodelu pwnc
      Python
      MIT License
      0000Updated Jan 12, 2024Jan 12, 2024
    • Casgliad o brofion Cymraeg ar gyfer modelau iaith mawr (llm) // A collection of Welsh language evals for large language models
      Shell
      MIT License
      0000Updated Nov 30, 2023Nov 30, 2023
    • Fersiwn wedi'i ddiweddaru o'r fersiwn Cymraeg o wirydd sillafu Hunspell. | An updated version of the Welsh version of the Hunspell spellchecker.
      Other
      0510Updated Nov 7, 2023Nov 7, 2023
    • Model Iaith Fectorau Word2vec ar sail corpora ymchwil yr Uned Technolegau Iaith a gasglwyd o ffynonellau amrywiol at ddibenion ymchwil fel cynhyrchu modelau iaith. | A Word2vec Language Model based on the Language Technologies Unit's research corpora.
      Python
      Apache License 2.0
      0100Updated Oct 31, 2023Oct 31, 2023
    • Fersiwn Cymraeg llafar o wirydd sillafu Hunspell. | Spoken Welsh version of the Hunspell spellchecker.
      Other
      0100Updated Oct 31, 2023Oct 31, 2023
    • Mae'r meddalwedd yma yn ceisio drawsgrifio'n fyw unrhyw leferydd Cymraeg a ddaw drwy seinydd eich cyfrifiadur Windows.//The code in this repository allows a locally installed speech recognition engine to transcribe any speech from the loudspeaker of your Windows PC.
      C#
      MIT License
      0000Updated Oct 31, 2023Oct 31, 2023
    • spacy

      Public
      Mae spaCy yn llyfrgell ar gyfer Prosesu Iaith Naturiol uwch yn Python a Cython. // spaCy is a library for advanced Natural Language Processing in Python and Cython.
      Python
      MIT License
      0000Updated Sep 11, 2023Sep 11, 2023