Missional AI

This repository contains Jupyter notebooks showing how to use MACULA Greek New Testament data. These notebooks were prepared for a tutorial at the 2023 Global Missional AI Summit on "Greek and Hebrew Datasets for Natural Language Processing". The tutorial was led by Jonathan Robie, Sean Boisen, and Randall Tan of Clear Bible, Inc.

Quick Start

New to Colab/Jupyter Notebooks: If you have never used Google Colaboratory or Jupyter notebooks, check out the Getting Started tutorial, and click the 'Open in Colab' button at the top of the file.

Experienced with Colab/Jupyter Notebooks: If you have used notebooks like this before, but you are new to MACULA Greek and Hebrew data, head to MACULA Data Overview to get started.

Tutorial Abstract

The original Greek and Hebrew texts are at the heart of Bible translation, and they have been analyzed by many different researchers in every conceivable way. But most NLP practitioners do not know Hebrew or Greek. MACULA is a set of linguistic datasets that describe the original Hebrew and Greek texts that are at the heart of Bible translation.

Using English glosses, semantic domains, and various descriptions of the text, they can be used by NLP practitioners without knowledge of the original languages. These datasets were developed by Clear Bible, United Bible Societies, SIL International, unfoldingWord, Translatable Exegetical Tools, Faith Comes by Hearing, the Groves Center, OpenScriptures, Cherith Analytics, and others, and they have been integrated to work together.

In this workshop, we will use Google Colab notebooks to show how to use this data for specific tasks, then demonstrate some useful NLP tasks such as exploratory data analysis, topic modeling, identifying important vocabulary in a passage using TF-IDF, and text summarization.

Participants will be encouraged to work at their own pace and ask questions. They are also welcome to work on their own projects using this data, or build on the notebooks we present.

Acknowledgements

The notebooks were created by Ryder Wishart and Nathan Brock.

License

All code in this repository is released under MIT License. For data licensing, see the data README.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
data		data
.gitignore		.gitignore
00_getting_started.ipynb		00_getting_started.ipynb
01_macula_data_overview.ipynb		01_macula_data_overview.ipynb
02_semantic_domains_overview.ipynb		02_semantic_domains_overview.ipynb
03_domain_topic_modelling.ipynb		03_domain_topic_modelling.ipynb
04_levinsohn_discourse_features.ipynb		04_levinsohn_discourse_features.ipynb
05_syntax_knowledge_graph.ipynb		05_syntax_knowledge_graph.ipynb
06_macula_greek_pandas.ipynb		06_macula_greek_pandas.ipynb
07_Alignments.ipynb		07_Alignments.ipynb
LICENSE.md		LICENSE.md
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Missional AI

Quick Start

Tutorial Abstract

Acknowledgements

License

About

Releases

Packages

Contributors 3

Languages

License

Clear-Bible/missional-ai

Folders and files

Latest commit

History

Repository files navigation

Missional AI

Quick Start

Tutorial Abstract

Acknowledgements

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages