Skip to content

A praat plugin which tokenize an interval tier into a word, syllable, or segment tier

License

Notifications You must be signed in to change notification settings

rolandomunoz/plugin_tokenizer

Repository files navigation

Tokenizer

This is a Praat plug-in which segments TextGrid annotations into segments, syllables or words. The resulting segmentation is stored as a new tier whithin the same TextGrid.

Documentation

Getting started

Prerequisites

  • Praat 6.0.40 or later (download here)

Install

Set-up

In the Object window, go to Praat > Tokenize > Settings..., a dialog box will appear.

How to use it?

Once you are done with the settings, you are ready to process your own TextGrids. Before starting, remember that you need to provide an interval tier as an input for this plug-in to work. Here is an example.

With this plug-in, you can segment TextGrids in the Object window and those stored in a folder.

From the Object window...

First, select those TextGrids that you want to segment. Then, go to TextGrid: Add tokenized tier...

When you click on it, you will see a dialog box. In Input tier, write the name of the tier where your annotations are stored. Then, check the segmentation levels to be be generated. Finally, press on Apply or Ok. The TexGrids are now segmented!

Screenshot_from_2017-12-06_23-17-36

From a folder...

In the Praat menu, go to Praat > Tokenizer > Tokenize(do all)... You will see a dialog box similar to one shown in the previous case. In the Folder with annotation files put the directory where your TextGrids are located in your machine. In Save results in, copy the path where the resulting files will be stored. Then, complete the other fields as explained before and press on Apply or Ok. The resulting files should be in the destiny directory.

Screenshot_from_2017-12-06_23-12-53

Author

  • Rolando Muñoz Aramburú

License

This project is licensed under the GNU GPL terms - see the LICENSE.md file for details.

How to cite?

Muñoz A., Rolando (2018). Tokenizer[Praat plug-in]. Version 1.2.0, retrived 20 Sep 2018 from https://gitlab.com/praat_plugins_rma/plugin_tokenizer

About

A praat plugin which tokenize an interval tier into a word, syllable, or segment tier

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published