TEI format conversions for spoken and multimodal data
TEI (xml / tei_corpo.xml / teiml / trjs)
TRS (transcriber)
CHA (chat - childes)
TXT (texte - utf8)
DOCX (microsoft word)
XLSX (microsoft excel)
CSV (tableurs)
TEXTGRID (praat)
EAF (elan)
TXM (xml/w)
Lexico/Le Trameur (.txt)
Remove these tiers from the output
Name of tiers (wildcard characters can be used)
Remove specific marks for spoken language
One line per utterance + tiers/sub-tiers to the right
One line per utterance + tiers/sub-tiers to the right organized by columns
One line per utterance, tiers/sub-tiers below, organized by tier names
One line per utterance, tiers/dépendances below, one element per line
Number of decimal digits for time values
Text format: n° - Speaker - Utterance (tabular version)
Text format: Start time - Speaker - Utterance - (tabular version)
Text format: Start and end time - Speaker - Utterance (tabular version)
Text format: Start and end time - Speaker - Utterance (tabular version) + Header (for orthographic checking)
Overlapping format: Speaker - Utterance
Overlapping format: Speaker - Turn
Format per block: Speaker - Time - Utterance
Format per line: n° - Speaker - Utterance - Time
Text without mark
Or click here to select a file =>
Ask for parameters for praat files.Choice of relation for
Results (Remove)
The TEI_CORPO format follows the proposals from the TEI Spoken ISO.
It matches fully the TEI standard.
A java for batch computing and complementary functions
can be downloaded here ici.
More informations are available here.
The Excel export option "one line per utterance" can be used to play linking video directly under Excel.
To do this, first export to Excel format, open the file and use copy and paste to insert the data in the downloadable Excel model
found here.
Warning, you must have VLC installed Downloading VLC.
Vidéos can be started under MACOS using the keys Alt+Cmd+W and under PC using the keys Shift+Ctrl+W.
This can be adjusted by editing the Excel macros.