tindex manpage

COMMAND

tindex

SYNOPSIS

tindex [-E] [-f string] [-a] [-b] file(s)/directory(ies) [ > output ]

OPTIONS

ⓘ	`-a`	Include all pitch and rhythm features in output index data (equivalent to `-pr`).
	`-b`	Include bibliographic (reference) records in output index data.
	`-B string`	Filter bibliographic (reference) records according to the list given in string.
	`-d dir`	Append directory name to filename tags. Removes any existing directory prefix.
	`-D`	Suppress directory prefix in filename tags.
ⓘ	`-E`	Suppress extra fields (key and meter descriptions)
ⓘ	`-f string`	Extract particular features using a list of tags for the features to extract.
ⓘ	`-G`	Do not index grace notes.
ⓘ	`-i`	Store the instrument name within the initial tag.
	`-l #`	Limit the number of notes processed for the entry to first # of the file.
	`-r`	Include all rhythmic fields in output. If used without `-p`, then it will suppress the extraction of pitch features. The `-a` option will also include all rhythm features in the extracted feature index.
	`-p`	Include all pitch features in output (default behavior).
ⓘ	`-Q`	Turn off quiet mode. Don't suppress control messages which start with '`#`' and describe important option settings for the themax and theloc commands.
	`-t file`	Substitute the filename in the first column of output data with a different string (typically a control number for database management).
ⓘ	`--end`	For chords (multi-stops), encode the last note in the token rather than the first note.
ⓘ	`--fermata`	Encode fermatas as segmentation boundaries.
ⓘ	`--phrase`	Encode phrase endings as segmentation boundaries.
ⓘ	`--moly`	Extract multiple monophonic entries from a polyphonic data file. Only the first sub-spine layer in each spines will be parsed. The default behavior of tindex it to extract data from all layers of all **kern data spines. [ore info]
ⓘ	`--rest`	Encode rests as segmentation boundaries.

DESCRIPTION

tindex

themax

tindex

Themefinder

The tindex program is an expanded version of the original themebuilder command written in AWK by David Huron, utilizing Humdrum Toolkit commands to extract pitch features from monophonic **kern data. The tindex program can emulate the original version by using the --mono option.

The tindex program allows indexing of polyphonic music consisting of strictly monophonic voices (molyphonic) as well as more complicated polyphony characteristic of keyboard music where voices enter and drop out from the overall texture of the music. The tindex/themax program pair can only handle monophonic sequences, so the tindex program can either choose the first or last note listed in a chord (multi-stop token). Other added capabilities include rhythmic feature extraction, reference (bibliographic) record extraction, selective feature extraction, grace note inclusion/exclusion, and segmentation boundary encoding.

As a basic example, consider the following six Humdrum files:

ex1.krn
**kern *clefG2 *k[f#] *G: *M4/4 *met(C) =1- 4.dd 8cc 4b 4g =2 4f# 4a 4g 4b =3 4a 4cc 4b 4dd =4 4d 4f# 4g 4g == *- ex2.krn
**kern *clefG2 *k[] *C: *M4/4 *met(C) =1- (8cc 8r 8g 8r 8e 8r 8r 8a =2 8d 8r 8r 8g 8c) 8r 8r (8ee =3 8f 8r 8r 8dd 8e 8r 8r 8cc =4 8dL 8a 8A 8BJ 8dL 8cJ 4r == *- ex3.krn
**kern *clefG2 *k[f#c#g#] *f#: *M4/4 (8bL 8ddJ =1 4.cc# 8b 8aL 8g#J 8aL 8cc#J =2 4b 4a 4g#) (8g#L 8aJ =3 8bL 8cc#J 4dd 4cc# 8bL 8cc#J =4 2a 2g# =5 2a) 4r == *- ex4.krn
**kern *clefG2 *k[b-e-a-] *E-: *M2/4 =1- [2e- =2 4e-] 8fL 8gJ =3 [2a- =4 4a-] 8b-L 8a-J =5 4g 8a-L 8gJ =6 4f 8e-L 8fJ =7 [2g =8 4g] 4r == *- ex5.krn
**kern *clefF4 *k[f#c#g#d#] *E: *M2/8 =1- ([4E =2 8EL] 16F#L 16EJJ =3 [4D# =4 8D#]L 16EL 16D#JJ =5 16C#LL 16BB 16C# 16D#JJ =6 16ELL 16F# 16G# 16F#JJ =7 8EL 8D#J =8 8E) 8r == *- ex6.krn
**kern *clefC3 *k[b-] *F: *M2/4 =1- (8F'L 8G' 8A' 8B-'J =2 8c'L 8A'J 4F) =3 (4f 8eL 8dJ =4 2c) =5 (8F'L 8G' 8A' 8B-'J =6 8c'L 8A'J 4F) =7 4c^ 4D^ =8 2F; == *-

Passing these files to tindex will generate a search entry line for each file:

tindex ex?.krn > index.thema

ex1.krn::1	ZG=	{m2m1m4m1p3m2p4m2p3m1p3m12p4p1p0	#ddDdUdUdUdUDUus	:DDDDUDUDUDUDUUS	%5431721324355711	}xM2xm2xM3xm2Xm3xM2XM3xM2Xm3xm2Xm3xP8XM3Xm2P1	j20B7697B90B22677	JD C B G F# A G B A C B D D F# G G 	M4/4quadruplesimple
ex2.krn::1	ZC=	{m5m3p5m7p5m7p16m11p9m10p8m10p7m12p2p3m2	#DDUDUDUDUDUDUDuUd	:DDUDUDUDUDUDUDUUD	%153625134231266721	}xP4xm3XP4xP5XP4xP5XM10xM7XM6xm7Xm6xm7XP5xP8XM2Xm3xM2	j074927045240299B20	JC G E A D G C E F D E C D A A B D C 	M4/4quadruplesimple
ex3.krn::1	zF#=	{p3m1m2m2m1p1p4m2m2m1p0p1p2p2p1m1m2p2m4m1p1	#UdddduUdddsuuuudduDdu	:UDDDDUUDDDSUUUUDDUDDU	%4654323543223456545323	}Xm3xm2xM2xM2xm2Xm2XM3xM2xM2xm2P1Xm2XM2XM2Xm2xm2xM2XM2xM3xm2Xm2	jB21B9891B9889B121B1989	JB D C# B A G# A C# B A G# G# A B C# D C# B C# A G# A 	M4/4quadruplesimple
ex4.krn::1	ZE-=	{p2p2p1p2m2m1p1m1m2m2p2p2	#uuuudduddduu	:UUUUDDUDDDUU	%1234543432123	}XM2XM2Xm2XM2xM2xm2Xm2xm2xM2xM2XM2XM2	j3578A87875357	JEb F G Ab Bb Ab G Ab G F Eb F G 	M2/4duplesimple
ex5.krn::1	ZE=	{p2m2m1p1m1m2m2p2p2p1p2p2m2m2m1p1	#udduddduuuuudddu	:UDDUDDDUUUUUDDDU	%12171765671232171	}XM2xM2xm2Xm2xm2xM2xM2XM2XM2Xm2XM2XM2xM2xM2xm2Xm2	j4643431B134686434	JE F# E D# E D# C# B C# D# E F# G# F# E D# E 	M2/8duplesimple
ex6.krn::1	ZF=	{p2p2p1p2m3m4p12m1m2m2m7p2p2p1p2m3m4p7m10p3	#uuuuDDUdddDuuuuDDUDU	:UUUUDDUDDDDUUUUDDUDU	%123453117651234531561	}XM2XM2Xm2XM2xm3xM3XP8xm2xM2xM2xP5XM2XM2Xm2XM2xm3xM3XP5xm7Xm3	j579A0955420579A095025	JF G A Bb C A F F E D C F G A Bb C A F C D F 	M2/4duplesimple

Application of the search index

themax

themax

tindex

ex4.krn

themax -p "e- f g" index.thema

If you want to perform an "AND" search with another independent musical feature, then the output from themax can be piped into another call to the program with the matches from the first search. To search for features in parallel (such as pitch and rhythm at the same time), the search queries are given as multiple options to a single call to themax. For example, the "e-flat, f, g" sequence occurs with the durations "dotted-half, eighth, eighth" which can be queried by the -u option:

themax -p "e- f g" -u "2. 8 8" index.thema

An additional program called theloc (thema location) can then be used to identify the location in the original file when the --location option is given to themax. In this case "=1B1" means the matched sequence occurs starting at measure 1, beat 1 in the original data file (ex4.krn):

themax -p "e- f g" -u "2. 8 8" index.thema --location | theloc -N

An additional output option from the theloc program will also mark the individual notes which caused the match. In the following search, the "e-flat, f, g" sequence is searched without considering the rhythm, and there are two locations in the file where the query is found. The --ending option has to be supplied along with the --location option so that both the starting and ending notes of the matches can be highlighted in the output data (otherwise, only the first note at the start of the match will be marked).

themax -p "e- f g" index.thema --location --ending | theloc --mark

Each matched note is marked with an "@" character, and an !!!RDF: record explaining that character's meaning (a matched note) is given at the bottom of the file. The example on the right is generated by adding the --tie option to the theloc program so that it will highlight all tied notes after the first note in a group of notes tied together. The marking character can be used to locate matches in a text editor (by searching for "@" in the resulting file, or the marking character can be used to highlight the note in graphical music notation, such as coloring the matched notes red:

Thema index pitch fields

Each line contains multiple fields separated by a tab character, and each field except the first one at the start of the line begins with a unique tag character to facilitate searches in the thema command. The ten tab-separated default entries on each line are:

An identification string which, by default, contains the name of the original file followed by two colons (which may be split by an instrument tag), and then the spine number in the original file from which the extracted features occur.
key -- starting with uppercase Z for major modes or lowercase z for minor modes, then the tonic note of the key (in uppercase), terminated by an equals sign (=). Example: ZG= which represents G major. Note that there must be a key designation record in the file in order for the key to be extracted into the index, and only the first key designation in the spine will be encoded in the index.
twelve-tone interval -- starting with an open curly brace ({), then a string of intervals without spaces, using m (minus) for falling melodic intervals, p (positive) for rising intervals (p is also used before repeated notes). Example: {m2m1m4m1p3m2p4m2p3m1p3m12p4p1p0
pitch refined contour -- starting with # and followed by five possible characters: d = down a diatonic step, D = down a diatonic leap (greater than an interval of a 2nd), s = same pitch (repeated note), u = up a step, U = up a leap. Example: #ddDdUdUdUdUDUus
pitch gross contour -- a three-level description of the melodic contour (as opposed to 5 for refined contour). The data field starts with a colon (:), and then has three possible characters: U = up (next note is a higher pitch than the current note), S = same or repeated note, D = down. Example: :DDDDUDUDUDUDUUS
scale degree -- starts with a percent sign (%) followed by the numbers 1 through 7 to indicate the seven diatonic steps of a major or minor scale. Accidentals are ignored, so both C and C# in C major are labeled as 1. Example: %5431721324355711 Note that there must be a key designation record in the file in order for the scale degrees to be extracted from the data.
musical interval -- abbreviations of the standard names for musical intervals. This field starts with a right curly brace (}), followed by a sequence of intervals without spaces which consist of three parts: (1) the interval direction (x for down, X for up), (2) the quality of the diatonic interval (M=major, m=minor, P=perfect, A=augmented, d=diminished, and (3) the diatonic distance as a number, such as 3 for a third. Example: }xM2xm2xM3xm2Xm3xM2XM3xM2Xm3xm2Xm3xP8XM3Xm2P1
twelve-tone pitch class. Starting with a j, then followed by the diatonic pitch classes, starting with C = 0, C-sharp/D-flat = 1, D = 2. For two digit pitch classes, letters of the alphabet are substituted: A-sharp/B-flat = 10 -> A, and B/C-flat = 11 -> B. Example: j20B7697B90B22677
diatonic pitch class -- Starting with J and followed by the pitch class names. This field is the only one which separates individual notes by spaces. Diatonic pitch names are in upper case (A through G) followed by an accidentals: # for sharps/double sharps, and - for flats/double flats. Example: JD C B G F# A G B A C B D D F# G G
metric description -- starting with an M, followed by the numeric values for the time signature, and then followed by quadruple, triple, etc which describes the type of metric cycle, followed by simple or compound depending on if the top number in the time signature is divisible by 3. Example: M4/4quadruplesimple

Rhythmic analysis option

Using the -r extracts eight rhythmic features into the output search index. When the -r option is used alone, the pitch features are suppressed. To include both pitch and rhythm features use the option pair -p -r or -a (for all) to include all musical features.

duration gross contour (~)
duration refined contour (^)
duration (as an inter-onset-interval) (!)
beat level (&)
metric level (`)
metric refined contour (')
metric gross contour (@)
beat position (=)

Listed below are example rhythmic features extracted from the six melodies given above. The first example extracted only the rhythmic features with the -r option, while the second example extracts all musical features with the -a option (both pitch and rhythm features).

tindex -r ex?.krn > index.thema

ex1.krn::1	ZG=	M4/4quadruplesimple	~<>=============	^[>=============	;4d 8 4 4 4 4 4 4 4 4 4 4 4 4 4 4 	&1011111111111111	'p2 m1 p1 0 p2 0 p1 0 p2 0 p1 0 p2 0 p1 0 	`WHwHWhwHWhwHWhw	@WHWHWHWHWHWHWHW	=x1 x2_1/2 x3 x4 x1 x2 x3 x4 x1 x2 x3 x4 x1 x2 x3 x4 
ex2.krn::1	ZC=	M4/4quadruplesimple	~=================	^=================	;8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 	&111010101010101010	'p2 0 p1 m1 p2 m1 p1 m1 p2 m1 p1 m1 p2 m1 0 m1 p1 m1 	`WhWHWHWHWHWHWhwHW	@WHWHWHWHWHWHWHWHW	=x1 x2 x3 x4_1/2 x1 x2_1/2 x3 x4_1/2 x1 x2_1/2 x3 x4_1/2 x1 x1_1/2 x2 x2_1/2 x3 x3_1/2 
ex3.krn::1	zF#=	M4/4quadruplesimple	~=><====>==<===>=<=>==	^=][====>==<===>=<=]==	;8 8 4d 8 8 8 8 8 4 4 4 8 8 8 8 4 4 8 8 2 2 2 	&1010101011110101110111	'0 m1 p2 m1 p1 m1 0 m1 p2 0 p1 0 m1 p2 m1 0 p1 0 m1 p2 p1 p2 	`wHWHWhwHWhwwHWhhwwHwh	@WHWHWHWHWHWWHWHHWWHWH	=x4 x4_1/2 x1 x2_1/2 x3 x3_1/2 x4 x4_1/2 x1 x2 x3 x4 x4_1/2 x1 x1_1/2 x2 x3 x4 x4_1/2 x1 x3 x1 
ex4.krn::1	ZE-=	M2/4duplesimple	~<=><=><=><=>	^[=][=><=><=]	;2d 8 8 2d 8 8 4 8 8 4 8 8 2d 	&1101101101101	'p1 0 m1 p1 0 m1 p1 0 m1 p1 0 m1 p1 	`wwHwwHwwHwwH	@WWHWWHWWHWWH	=x1 x2 x2_1/2 x1 x2 x2_1/2 x1 x2 x2_1/2 x1 x2 x2_1/2 x1 
ex5.krn::1	ZE=	M2/8duplesimple	~<=><=========>==	^[=][=========>==	;4d 16 16 4d 16 16 16 16 16 16 16 16 16 16 8 8 8 	&11011010101010111	'0 0 m1 0 0 m1 0 m1 0 m1 0 m1 0 m1 0 0 0 	`SwhSwhwhwhwhwhSS	@SWHSWHWHWHWHWHSS	=x1 x2 x2_1/2 x1 x2 x2_1/2 x1 x1_1/2 x2 x2_1/2 x1 x1_1/2 x2 x2_1/2 x1 x2 x1 
ex6.krn::1	ZF=	M2/4duplesimple	~=====>=<=><=====>==>	^=====>=<=][=====>==>	;8 8 8 8 8 8 4 4 8 8 2 8 8 8 8 8 8 4 4 4 2 	&101010111011010101111	'p1 m1 0 m1 p1 m1 0 p1 0 m1 p1 p1 m1 0 m1 p1 m1 0 p1 0 p1 	`WhwHWhhwwHSWhwHWhhwh	@WHWHWHHWWHSWHWHWHHWH	=x1 x1_1/2 x2 x2_1/2 x1 x1_1/2 x2 x1 x2 x2_1/2 x1 x1 x1_1/2 x2 x2_1/2 x1 x1_1/2 x2 x1 x2 x1

tindex -a ex?.krn > index.thema

ex1.krn::1	ZG=	{m2m1m4m1p3m2p4m2p3m1p3m12p4p1p0	#ddDdUdUdUdUDUus	:DDDDUDUDUDUDUUS	%5431721324355711	}xM2xm2xM3xm2Xm3xM2XM3xM2Xm3xm2Xm3xP8XM3Xm2P1	j20B7697B90B22677	JD C B G F# A G B A C B D D F# G G 	M4/4quadruplesimple	~<>=============	^[>=============	;4d 8 4 4 4 4 4 4 4 4 4 4 4 4 4 4 	&1011111111111111	'p2 m1 p1 0 p2 0 p1 0 p2 0 p1 0 p2 0 p1 0 	`WHwHWhwHWhwHWhw	@WHWHWHWHWHWHWHW	=x1 x2_1/2 x3 x4 x1 x2 x3 x4 x1 x2 x3 x4 x1 x2 x3 x4 
ex2.krn::1	ZC=	{m5m3p5m7p5m7p16m11p9m10p8m10p7m12p2p3m2	#DDUDUDUDUDUDUDuUd	:DDUDUDUDUDUDUDUUD	%153625134231266721	}xP4xm3XP4xP5XP4xP5XM10xM7XM6xm7Xm6xm7XP5xP8XM2Xm3xM2	j074927045240299B20	JC G E A D G C E F D E C D A A B D C 	M4/4quadruplesimple	~=================	^=================	;8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 	&111010101010101010	'p2 0 p1 m1 p2 m1 p1 m1 p2 m1 p1 m1 p2 m1 0 m1 p1 m1 	`WhWHWHWHWHWHWhwHW	@WHWHWHWHWHWHWHWHW	=x1 x2 x3 x4_1/2 x1 x2_1/2 x3 x4_1/2 x1 x2_1/2 x3 x4_1/2 x1 x1_1/2 x2 x2_1/2 x3 x3_1/2 
ex3.krn::1	zF#=	{p3m1m2m2m1p1p4m2m2m1p0p1p2p2p1m1m2p2m4m1p1	#UdddduUdddsuuuudduDdu	:UDDDDUUDDDSUUUUDDUDDU	%4654323543223456545323	}Xm3xm2xM2xM2xm2Xm2XM3xM2xM2xm2P1Xm2XM2XM2Xm2xm2xM2XM2xM3xm2Xm2	jB21B9891B9889B121B1989	JB D C# B A G# A C# B A G# G# A B C# D C# B C# A G# A 	M4/4quadruplesimple	~=><====>==<===>=<=>==	^=][====>==<===>=<=]==	;8 8 4d 8 8 8 8 8 4 4 4 8 8 8 8 4 4 8 8 2 2 2 	&1010101011110101110111	'0 m1 p2 m1 p1 m1 0 m1 p2 0 p1 0 m1 p2 m1 0 p1 0 m1 p2 p1 p2 	`wHWHWhwHWhwwHWhhwwHwh	@WHWHWHWHWHWWHWHHWWHWH	=x4 x4_1/2 x1 x2_1/2 x3 x3_1/2 x4 x4_1/2 x1 x2 x3 x4 x4_1/2 x1 x1_1/2 x2 x3 x4 x4_1/2 x1 x3 x1 
ex4.krn::1	ZE-=	{p2p2p1p2m2m1p1m1m2m2p2p2	#uuuudduddduu	:UUUUDDUDDDUU	%1234543432123	}XM2XM2Xm2XM2xM2xm2Xm2xm2xM2xM2XM2XM2	j3578A87875357	JEb F G Ab Bb Ab G Ab G F Eb F G 	M2/4duplesimple	~<=><=><=><=>	^[=][=><=><=]	;2d 8 8 2d 8 8 4 8 8 4 8 8 2d 	&1101101101101	'p1 0 m1 p1 0 m1 p1 0 m1 p1 0 m1 p1 	`wwHwwHwwHwwH	@WWHWWHWWHWWH	=x1 x2 x2_1/2 x1 x2 x2_1/2 x1 x2 x2_1/2 x1 x2 x2_1/2 x1 
ex5.krn::1	ZE=	{p2m2m1p1m1m2m2p2p2p1p2p2m2m2m1p1	#udduddduuuuudddu	:UDDUDDDUUUUUDDDU	%12171765671232171	}XM2xM2xm2Xm2xm2xM2xM2XM2XM2Xm2XM2XM2xM2xM2xm2Xm2	j4643431B134686434	JE F# E D# E D# C# B C# D# E F# G# F# E D# E 	M2/8duplesimple	~<=><=========>==	^[=][=========>==	;4d 16 16 4d 16 16 16 16 16 16 16 16 16 16 8 8 8 	&11011010101010111	'0 0 m1 0 0 m1 0 m1 0 m1 0 m1 0 m1 0 0 0 	`SwhSwhwhwhwhwhSS	@SWHSWHWHWHWHWHSS	=x1 x2 x2_1/2 x1 x2 x2_1/2 x1 x1_1/2 x2 x2_1/2 x1 x1_1/2 x2 x2_1/2 x1 x2 x1 
ex6.krn::1	ZF=	{p2p2p1p2m3m4p12m1m2m2m7p2p2p1p2m3m4p7m10p3	#uuuuDDUdddDuuuuDDUDU	:UUUUDDUDDDDUUUUDDUDU	%123453117651234531561	}XM2XM2Xm2XM2xm3xM3XP8xm2xM2xM2xP5XM2XM2Xm2XM2xm3xM3XP5xm7Xm3	j579A0955420579A095025	JF G A Bb C A F F E D C F G A Bb C A F C D F 	M2/4duplesimple	~=====>=<=><=====>==>	^=====>=<=][=====>==>	;8 8 8 8 8 8 4 4 8 8 2 8 8 8 8 8 8 4 4 4 2 	&101010111011010101111	'p1 m1 0 m1 p1 m1 0 p1 0 m1 p1 p1 m1 0 m1 p1 m1 0 p1 0 p1 	`WhwHWhhwwHSWhwHWhhwh	@WHWHWHHWWHSWHWHWHHWH	=x1 x1_1/2 x2 x2_1/2 x1 x1_1/2 x2 x1 x2 x2_1/2 x1 x1 x1_1/2 x2 x2_1/2 x1 x1_1/2 x2 x1 x2 x1

Selective feature indexing

tindex

themebuilder

-r

-a

However, if you only want a specific subset of any of the extractable features, use the -f option followed by a list of the features to extract according to the feature tags in the following table. This option is useful when only specific musical features will be searched. In these case, index file size will be minimized and search processing time will be increased by only including the desired musical features.

Feature tag	Feature prefix	Themax option	Description
PCH, P, PC	`J`	`-p`	Diatonic Pitch Class: C, C#, D-, D, E--, F#, F##, G, etc.
MI, DI, INT, I	`}`	`-I`	Diatonic Interval: +P5, -3, P1, +P8, etc.
SD, S, D	`%`	`-d`	Diatonic Scale Degree: 1, 2, 3, 4, 5, 6, 7.
12P	`j`	`-P`	Twelve-tone Pitch Class: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B.
12I	`{`	`-i`	Twelve-tone Interval: 0, -1, +2, +5, -7, etc.
PGC, GC, CON	`:`	`-C`	Pitch Gross Contour: U, D, S.
PRC, RC	`#`	`-c`	Pitch Refined Contour: U, u, D, d, S.
DUR, IOI	`;`	`-u`	Duration: 8, 4, 16, 8., 12, etc.
DGC, RGC	`~`	`-R`	Duration Gross Contour:
DRC, RRC	`^`	`-r`	Duration Refined Contour:
BLV	`&`	`-b`	Beat Level
MLV	`'`	`-L`	Metric Level
MPS	`=`	`-l`	Metric Position
MRC	`	`-e`	Metric Refined Contour
MGC	`@`	`-E`	Metric Gross Contour

When a particular feature has more than one tag, any of those tags are aliases for the same musical feature. For example, to extract only the diatonic pitch class feature, use the option -f "PCH", or equivalently: -f "P", -f "PC".

tindex -f "PCH" ex?.krn > output.thema

The musical key and meter features can be suppressed with the -E option (meaning: no extra features).

tindex -E -f "PCH" ex?.krn > output.thema

Multiple selected features can be extracted by adding them to the -f option string, separated by one or more space characters. The ordering of the features in the option string does not matter: all features will be output in the canonical order required for searching with the themax command. In the following example, the pitch and duration features are extracted. Even though the features are listed in the order duration/pitch, the output index is ordered pitch/duration.

tindex -E -f "DUR PCH" ex?.krn > output.thema

Including segmentation markers

R

control messages

-Q

Rests as segmentation boundaries

The --rest option will cause "R" segmentation markers to be placed within all extracted pitch and rhythm features whenever pitch sequences are separated by one or more rests. Only one rest marker will be inserted between two pitch/rhythm features, even if there are multiple intervening rests.

For pitch/rhythm interval features, if a single pitch is surrounded by rests both before and after the note, there will be two "R" markers in a row in the extracted features data (see example below). This is used to keep track of the number of notes in the original musical data for later alignment of the search matches in the musical data.

input
**kern *clefF4 *k[] *C: *M3/4 16C\LL 16D\ 16E\ 16F\JJ = 8G\ 8r 8G\ 8r 16C\LL 16D\ 16E\ 16F\JJ = *- tindex -a input > output
rest.krn ZC= {p2p2p1p2p0m7p2p2p1 #uuuusDuuu :UUUUSDUUU %1234551234 }XM2XM2Xm2XM2P1xP5XM2XM2Xm2 j0245770245 JC D E F G G C D E F M3/4triplesimple ~===>=<=== ^===>=<=== ;16 16 16 16 8 8 16 16 16 16 &0000100000 'm1 m3 m2 m3 p1 m1 m1 m3 m2 m3 `WhwHWSWhw @WHWHWSWHW =x3 x3_1/4 x3_1/2 x3_3/4 x1 x2 x3 x3_1/4 x3_1/2 x3_3/4 tindex -a --rest input > output
rest.krn ZC= {p2p2p1p2RRp2p2p1 #uuuuRRuuu :UUUURRUUU %12345R5R1234 }XM2XM2Xm2XM2RRXM2XM2Xm2 j02457R7R0245 JC D E F G R G R C D E F M3/4triplesimple ~===>RR=== ^===>RR=== ;16 16 16 16 8 R 8 R 16 16 16 16 &00001R0R0000 'm1 m3 m2 m3 p1 R m1 R m1 m3 m2 m3 `WhwHRRWhw @WHWHRRWHW =x3 x3_1/4 x3_1/2 x3_3/4 x1 R x2 R x3 x3_1/4 x3_1/2 x3_3/4

Fermatas as segmentation boundaries

--fermata

R

--rest

R

--fermata

--rest

R

input
**kern 4c 4e; 8r 8f 4g; 2a *- no segmentation
fermata.krn ZC= {p4p1p2p2 #Uuuu :UUUU %13456 }XM3Xm2XM2XM2 j04579 JC E F G A M --fermata only
#FERMATA fermata.krn ZC= {p4Rp2R #URuR :URUR %13R45R6 }XM3RXM2R j04R57R9 JC E R F G R A M --fermata & --rest
#REST #FERMATA fermata.krn ZC= {p4Rp2R #URuR :URUR %13R45R6 }XM3RXM2R j04R57R9 JC E R F G R A M --rest only
#REST fermata.krn ZC= {p4Rp2p2 #URuu :URUU %13R456 }XM3RXM2XM2 j04R579 JC E R F G A M

Phrase endings as segmentation boundaries

Phrase endings (}) can also be used to mark segmentation boundaries with the R character in the output feature index data. Phrase endings can fall on rests.

input
**kern {4c 4d 4e} {4f 4g} 8r {4a 4b 4c} *- no segmentation
phrase.krn ZC= {p2p2p1p2p2p2m11 #uuuuuuD :UUUUUUD %12345671 }XM2XM2Xm2XM2XM2XM2xM7 j024579B0 JC D E F G A B C M --phrase only
#PHRASE phrase.krn ZC= {p2p2Rp2Rp2m11 #uuRuRuD :UURURUD %123R45R671R }XM2XM2RXM2RXM2xM7 j024R57R9B0R JC D E R F G R A B C R M --phrase & --rest
#REST #PHRASE phrase.krn ZC= {p2p2Rp2Rp2m11 #uuRuRuD :UURURUD %123R45R671R }XM2XM2RXM2RXM2xM7 j024R57R9B0R JC D E R F G R A B C R M --rest only
#REST phrase.krn ZC= {p2p2p1p2Rp2m11 #uuuuRuD :UUUURUD %12345R671 }XM2XM2Xm2XM2RXM2xM7 j02457R9B0 JC D E F G R A B C M

Grace notes

-G

-Q

#NOGRACE

control message

theloc

-Q

themax

theloc

input
**kern 4c 4d 16gq 4e 4f 4g *- tindex input > output
grace.krn ZC= {p2p5m3p1p2 #uUDuu :UUDUU %125345 }XM2XP4xm3Xm2XM2 j027457 JC D G E F G M adding the -G option
#NOGRACE grace.krn ZC= {p2p2p1p2 #uuuu :UUUU %12345 }XM2XM2Xm2XM2 j02457 JC D E F G M

Polyphonic option

Polyphonic data extraction can be done by using the --poly option. This option extracts multiple entries for a file, with one line for each **kern spine in the file.

Only the first layer of a spine is used for building an index. For example, here is a file with two spines of **kern data:

poly.krn

Running the command "tindex --poly poly.krn" will generate two entries:

Each entry adds a double colon (::) after the filename (or text string substitution when using the -t option), followed by the spine number from which the indexing data was extracted. Note that the second column of data in the second spine is currently ignored.

Fully polyphonic melodic extraction

--poly2

--poly

--poly2

poly.krn

tindex --poly2 poly.krn -E -f "P"

Notice that with the --poly2 option, an additional line is added to the output index data. The third line in the above index data represents the pitch sequence found in the second subs-pine (i.e., the second layer) of the second spine. All secondary subs-pine data is indicated in the voice number after the filename after a period character after the primary spine number. In the above example "2.2" means that the sequence is from the second spine in the file (first 2), and in the second sub-spine in spine 2 (second 2).

When secondary subs-pines are not contiguous, a segmentation marker will be added to the output data.

poly2.krn

tindex --poly2 poly2.krn -EfP

In the above example, the pitch sequence "ff gg" in the second subs-pine of the second spine is not immediately followed by the pitch sequence "bb ccc" later in the sub-spine. Therefore, a segmentation marker (R) is added between these two sub-sequences. In addition, since the second sub-spine (second layer) of the second spine does not start at the beginning of the music, a segmentation marker starts the index data for the second layer of the second spine.

Instrument label

In order to allow searching by instrument, the -i option can be given to store an instrumental name within the initial tag field of an index entry. Instrument names are give in Humdrum **kern data with a tandem interpretation starting with the characters *I:. Currently the instrument label will only work when the --poly or --poly2 option is also given.

instrument.krn

Running the command "tindex -i --poly instrument.krn" will generate two entries which include the instrument label:

Including bibliographic records

When the -b option is given to tindex, all bibliographic (reference) records found in the input Humdrum file will be appended to the end of the feature list on an output index line. All bibliographic records will be placed in sorted ASCII order which is required for searching in multiple records using themax. Each bibliographic entry will be separated by a tab character on the output index line.

The -B option can be used to select only particular bibliographic records to store in the output index data. For example, to only store title records (if they are present in the input data), then the option would be -B "OTL". In this case all other bibliographic records, such as COM (composer's name records) will be suppressed.

To allow more than one bibliographic record type in the -B record filter string, each bibliographic key should be separate by spaces, colons, and/or commas. For example, to allow for the composer and title only in the output index, use "-B "COM, OTL" or -B "COM:OTL". The order of the bibliographic keys in the argument string for -B is not important, since the output index data will always produce bibliographic records in sorted ASCII order.

The bibliographic keys within the -B string are actually regular expressions. This allows for more specific filtering rules, such as:

-B "^C" == allow all bibliographic records which start with a capital C, such as COM (composer) and CDT (composer's dates).
-B "^C,^O" == allow all bibliographic records which start with either a C or an O.
-B "^OTL$" == allow bibliographic records which match exactly to OTL, and suppress records such as OTL1, OTL@@FRE (title in the original language of French), or OTL@ENG (title translated into English).

By default -B "OTL" will match to bibliographic keys such as: OTL, OTL1, OTL@@FRE, OTL@ENG since all of them contain the string "OTL". The regular expression anchors for start and end of line (^ and $) are local to each bibliographic key in a -B option string.

Control messages

Command-line settings which can affect the operation of themax and theloc are stored in control messages in the output data if the -Q option is specified. These messages start with a hash sign (#). All of these messages are suppressed in the output if the -Q option is not given. Messages will not contain tab characters on the line, which could interfere with the search mechanism within themax. Here is a list of the messages which may occur:

#REST: The --rest option was given in the command-line call to tindex. This option includes "R" markers in the output data which are used to prevent pitch sequences from crossing rest boundaries. The default behavior of #NOREST will be include in the index if the --rest option has not been used, so that multiple indexes extracted with different option settings can be processed together properly.
#FERMATA: The --fermata option was given in the command-line call to tindex. This option includes "R" markers in the output data which are used in a similar manner to the --rest option to prevent pitch sequences from crossing phrase boundaries. Fermatas are encoded in **kern data as semi-colons (;). The default behavior of #NOFERMATA will be include in the index if the --fermata option has not been used, so that multiple indexes extracted with different option settings can be processed together properly.
#PHRASE: The --phrase option was given in the command-line call to tindex. This option includes "R" markers in the output data which are used in a similar manner to the --rest and --fermata option to prevent pitch sequences from crossing phrase boundaries. Phrases endings are encoded in **kern data as closing curly braces (}). The default behavior of #NOPHRASE will be include in the index if the --phrase option has not been used, so that multiple indexes extracted with different option settings can be processed together properly.
#NOGRACE: The -G option was given in the command-line call to tindex. This option suppresses grace note indexing in the output data. If you use the -G option, this control message is required as input into the theloc command (or the option to specify that grace notes were ignored). The default behavior of #GRACE will be include in the index if the -G option has not been used, so that multiple indexes extracted with different option settings can be processed together properly.
#OVERLAP: The --overlap option in the themax will cause the #OVERLAP message to be printed in its output (currently only in certain cases). A "#NOOVERLAP" may be present in the output from themax to indicate that this option was not used.

Directory processing

tindex

.krn

.thm

Chord processing

The tindex program processes sequences of notes, and therefore it is not useful for searching notes occurring at the same time (see sonority for that). When tindex encounters a chord (or multi-stop) token, it processes only the first note in the token. Typically this note is the lowest note in the chord (although this is not required). If you instead prefer the highest note in the chord, use the --end option to extract the last note in multi-stop tokens.

chord.krn

tindex -E -f "PCH" chord.krn > output.thema

tindex --end -E -f "PCH" chord.krn > output.thema

Note offsets

tindex

theloc

     !noff:17

tindex

For example, the following music contains music in 2/4 and 3/4. Since each entry in a thema index can only indicate a single key/meter, the music can be chopped into two segments, one for each section. The second segment of the music starts with the 7th note of the original music, so add !noff:7 before the first data line in the second segment:

original

1st part

2nd part

When tindex processes the two parts, the note offset value will be stored in the entry for the second segment:

tindex -p twometer[AB].krn

In order to fully link back to the original file, add a global comment to the segmented files which gives the name of the original file:

      !!original-filename: twometer.krn

Then when the index data is created with tindex the original filename will be used instead of the segment's filename:

original

1st part

2nd part

tindex -p twometer[AB]2.krn

Now when themax is used, the correct note numbers will be marked. For example, searching for the pitch sequence "G A" should find two matches—one starting on note 4 and the other starting on note 7 in the original file.

themax -p "ga" --loc twometer2.index

This information can be fed into theloc to mark the matched notes in the original file:

cat twometer.thema | theloc -m

Which can then be converted to highlighted notes in a conversion to graphical music notation:

If you only want to search music selectivly in triple meter, the split data segments make this possible:

EXAMPLES

tindex

tindex examples page

ONLINE DATA

       program file.krn

       program http://www.some-computer.com/some-directory/file.krn

       cat file.krn | program

       echo http://www.some-computer.com/some-directory/file.krn | program

Besides the http:// protocol, there is another special resource indicator prefix called humdrum:// which downloads data from the kernscores website. For example, using the URI humdrum://brandenburg/bwv1046a.krn:

      program humdrum://brandenburg/bwv1046a.krn

http://kern.humdrum.org/cgi-bin/ksdata?file=bwv1046a.krn&l=/brandenburg&format=kern

Musedata Bach Brandenburg Concerto collection.

This online-access of Humdrum data can also interface with the classical Humdrum Toolkit commands by using humcat to download the data from the kernscores website. For example, try the command pipeline:

      humcat humdrum://brandenburg/bwv1046a.krn | census -k

DOWNLOAD

tindex

Linux (i386 processors) (dynamically linked) compiled on 6 Oct 2013.
Windows compiled on 29 Jun 2012.
Mac OS X/i386 compiled on 13 Nov 2013.

The source code for the program was last modified on 7 Apr 2013. Click here to go to the full source-code download page.

tindex manpage

COMMAND

SYNOPSIS

OPTIONS

DESCRIPTION

Application of the search index

Thema index pitch fields

Rhythmic analysis option

Selective feature indexing

Including segmentation markers

Rests as segmentation boundaries

Fermatas as segmentation boundaries

Phrase endings as segmentation boundaries

Grace notes

Polyphonic option

Fully polyphonic melodic extraction

Instrument label

Including bibliographic records

Control messages

Directory processing

Chord processing

Note offsets

EXAMPLES

ONLINE DATA

SEE ALSO

DOWNLOAD