split_query.py

View code on Github

Split forms into groups of up to six forms per query based on identifiers.

scribe_data.check.check_missing_forms.split_query.split_group_by_identifier(language_entry, output_dir, sub_lang_iso_code=None)[source]

Split forms into groups of up to six forms per query based on identifiers.

Parameters:
language_entrydict

Dictionary containing language data with missing features. Format: {language_qid: {data_type_qid: [features]}}.

output_dirstr or Path

Directory where generated query files should be saved.

sub_lang_iso_codestr, optional

ISO code for sub-language if applicable.

Notes

Groups forms based on their identifiers to avoid generating too many queries. Combines small groups when possible to reduce the number of query files.