generate_query.py
Generate SPARQL queries for missing lexeme forms.
- scribe_data.check.check_missing_forms.generate_query.get_available_filename(base_path)[source]
Find the next available filename by incrementing counter if file exists.
- Parameters:
- base_pathstr
Base path for the query file.
- Returns:
- str
Available filename that doesn’t conflict with existing files.
Examples
- If no files exist:
Returns query_{data_type}.sparql
- If query_{data_type}.sparql exists:
Renames existing query_{data_type}.sparql to query_{data_type}_1.sparql
Returns query_{data_type}_2.sparql
- If last file is query_{data_type}_N.sparql:
Returns query_{data_type}_(N+1).sparql
- scribe_data.check.check_missing_forms.generate_query.generate_query(missing_features, query_dir=None, sub_lang_iso_code=None)[source]
Generate SPARQL queries for missing lexeme forms.
- Parameters:
- missing_featuresdict
Dictionary containing missing features by language and data type. Format: {language_qid: {data_type_qid: [[form_qids]]}}.
- query_dirstr or Path, optional
Directory where query files should be saved. If None, uses default language_data_extraction directory.
- sub_lang_iso_codestr
The ISO-2 code of a sub-language if there is one being provided.
- Returns:
- str
Path to the generated query file.
Notes
Generates a single query file combining all forms for a given language and data type combination.
Query files are named incrementally if duplicates exist.
Creates necessary directories if they don’t exist.