Skip to content

In an attempt to prove that Finnish is a random compilation of letters and occasional doubling of certain letters, I will try to randomly generate words and compare them to my Finnish database. But first I needed to do some statistics on the Finnish language regarding the occurence of characters in words. I was lucky to find a generous database …

Notifications You must be signed in to change notification settings

NassarNa/AlternativeFinnish

Repository files navigation

AlternativeFinnish (WIP)

In an attempt to prove/disprove that Finnish is a random compilation of letters and occasional doubling of certain letters, I will try to randomly generate words and compare them to a Finnish database which I found online.

Weighted randomness will be based on actual statistics derived from the real database. Therefore I needed to do some statistics on the Finnish language regarding the occurence of letters/characters in words. I was lucky to find a generous database of over 94000 words. I am aware that some words are not Finnish, but are used in the Finnish language.

#Sources: #https://kaino.kotus.fi/sanat/nykysuomi/ (My current database bible)

I edited the database to my needs for my initial analysis using Excel and simple commands (see Finnish_wordlist_edited_v003.xlsx) My first statistical results using python script Finnish_Alternative_ver001_002_GitHub.py (see Finnish_Alphabet_FirstResults.xlsx version 26.05.2022)

More to follow...

Disclaimer: I don't speak or understand any Finnish (at least for now), this all started when I was driving with a colleague of mine through Switzerland and a Finnish song was playing on the radio. As I read the title I couldn't resist cracking a joke. What started as a bad joke became a challenge and a new project to practice my hobby in coding with python. If you're Finnish and/or human you shouldn't feel offended. When/If I finish my project it should be applicable to any language provided a similar database. And then we could mock Swedish and prove it is random too.

About

In an attempt to prove that Finnish is a random compilation of letters and occasional doubling of certain letters, I will try to randomly generate words and compare them to my Finnish database. But first I needed to do some statistics on the Finnish language regarding the occurence of characters in words. I was lucky to find a generous database …

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages