Skip to content

Simple one-off script to publish Estonian language lexemes at Wikidata

Notifications You must be signed in to change notification settings

62mkv/estonian-forms

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Things to consider

  1. Some lexemes have multiple paradigms. This is defined as combination of "declination type" + "option_number" in the fmsynth output
  2. Currently, option number is ignored during import, which is bad. it has to be added, with a caveat:
    1. For some lexemes, fmsynth will report same paradigm twice (see "saks"). Duplicated paradigms must be ignored while imported.
  3. From the article, "initial forms" could be parsed to compare with fmsynth-provided paradigms, and non-compliant paradigms must be ignored.
  4. This should provide for much safer articleForm sets.

Running in IntelliJ IDEA

In order to run JFX application in IntelliJ IDEA Community:

  • download JavaFX SDK into some folder
  • create "Application" type Run Configuration
  • add VM options: --module-path /path/to/javafx-sdk-21.0.x/lib --add-modules javafx.controls,javafx.fxml

About

Simple one-off script to publish Estonian language lexemes at Wikidata

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages