language_data_extraction/

View code on Github

This directory contains all language extraction and formatting code for Scribe-Data. The structure is broken down by language, with each language sub-directory then including directories for nouns, prepositions, translations and verbs if needed. Within these data type directories are query_DATA_TYPE.sparql SPARQL files that are ran to query Wikidata and then formatted with the given format_DATA_TYPE.py Python files.

Use the View code on GitHub link above to view the directory and explore the process!