resources

Repository of resources

This section contains the following:

Converter: The various jupyter notebooks used for the two-step conversion (XML->pickle->TF)
Sourcedata: The various versions of the XML data (used as input to the first step of the conversion, i.e. XML -> pickle).
Picklefiles: The various versions of the (zipped) pickle files (=output step 1) used for creating the Text-Fabric files.

Also in this directory are a few Jupyter Notebooks related to handling of the source data:

The following notebooks are not directly related to the creation of the Text-Fabric dataset, but are added to analyse some aspects of the GBI source data.

Name		Name	Last commit message	Last commit date
parent directory ..
converter		converter
images		images
picklefiles		picklefiles
sourcedata		sourcedata
CompareTwoXMLfiles.ipynb		CompareTwoXMLfiles.ipynb
README.md		README.md
calculate_PCFG_parameters.ipynb		calculate_PCFG_parameters.ipynb
calculate_SVO-VSO-etc.ipynb		calculate_SVO-VSO-etc.ipynb
differences_word_normalized.ipynb		differences_word_normalized.ipynb
duplicate.ipynb		duplicate.ipynb
identify_punctuations.ipynb		identify_punctuations.ipynb
identifying_critical_signs.ipynb		identifying_critical_signs.ipynb
unicode_normalized_tagvalue_comparison.ipynb		unicode_normalized_tagvalue_comparison.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

resources

resources

README.md

Repository of resources

Files

resources

Directory actions

More options

Directory actions

More options

Latest commit

History

resources

Folders and files

parent directory

README.md

Repository of resources