Automatic Generation of Expressive Performance by using Music Structures

Automatic Generation of Expressive Performance
by using Music Structures

Kenzi NOIKE, Nobuo TAKIGUCHI,
Takashi NOSE, Yoshiyuki KOTANI, Hirohiko NISIMURA
Department of Computer Science
Tokyo University of Agriculture and Technology
Abstract

Musical performances played by humans have expressions generated by variation of tempos and dynamics of sounds, different from mechanical performances by musical scores. We claim that this phenomenon is caused by the two facts, that a person can recognize whole musical structures, and that he can interpret melody lines of music. Based on this thought, we design an automatic expressive music performing system by using music structures. We designed an algorithm which automatically analyzes musical structures formalized as binary syntactic trees. Such structures reflect human musical perception. We derive syntactic structures of several music scores using the algorithm, in order to research correspondences between these binary syntactic trees and musical performances played by humans.

1. Introduction
We thought that humans can expressive performance which have variation of tempos and dynamics of sounds, since they can recognize whole musical structures, and they can interpret melody lines of music. Based on this thought, we plan to design a system of automatical expressive performing of music in the same manner as human performance, by using musical structures.
There are the researches of [Ohta, 1987] and [Friberg, 1991] and so on, in the research of this field.
The local metronomic speed in Waltz No.10 in b minor by F. Chopin was analyzed in [Ohta, 1987], mean value and standard deviation in every motives and phrases were calculated and it's motives and phrases were interpreted manually. And also, they calculated autocorrelation function, the way of quantitative analyze of performer's individuality was an open problem as a future theme.
He presented performance rules resulting from a project in which music performance had been analyzed by means of an analysis-by-synthesis procedure, in [Friberg, 1991]. it's were the rules to convert the written score to musically-acceptable performance with pitch, duration, chord symbols, phrase markers and key, and so on. Signs for phrase was edited

manually.
Our research consists of the four stages as follows:

1. to design the algorithm which automatically analyzes musical structures from musical score information.
2. to derive syntactic structures of music by using the algorithm.
3. to construct performance rules from the correspondences between the syntactic structures of music and musical performances played by humans.
4. to generate artificial expressive performance data by applying these rules.

The analysis algorithm given at the stage 1, can extract musical inner structures, it is simple and mechanical.
We have confirmed at the stage 2 that, generated by the algorithm structures of binary syntactic trees reflect human perception of musical structures, and syntactic structures of music are derived by the it.
The stage 3 is now in progress. We have confirmed that there are the correspondences between syntactic structures of music derived by the algorithm and musical performances played by humans. We are doing to construct the rules of its, now.

The data format for performing system is designed at the stage 4.

2. Analysis of the musical structures
2.1 The Structure Analysis Algorithm
We designed an algorithm for analyzing musical structure. It has the new properties to construct better structures. First "pitch ratio" between adjacent notes is used in it. Second, rests are dealt with in it. The algorithm is shown below:

step 1. to get a symbol string, replacing each musical note by a symbol in the manner that same symbols are used for the notes which have the same pitch and duration. Here each rest is dealt as the property of its antecedent note. The note with a rest is replaced by another symbol than any note without a rest.

step 2. to find a pair of symbols, which occurs in the string twice or more times.

step 3. if the pair is found, to replace all its occurrences by a new unique symbol, and to repeat from step 2.

step 4. if there remains no pair in the condition of step 3, then to find a pair of symbols whose occurrences have the same duration pattern and the same pitch ratio. Pitch ratio means the ratio between the pitch of the last note in the first sequence and that of the first in the second that is, difference of semitone unit counts.

step 5. if the pair is found, to replace all its occurrences by a new symbol. If there is a pair which has the same condition and has already been assigned a symbol such as "A" by the step 3, then the new symbol is the old one with a prime symbol such as " A' ".

step 6. if there remains no pair in the condition of step 5, then to restart the process from the step 3, making pairs of the newly obtained symbols.

step 7. if no new symbols are generated, then to stop.

By the this algorithm, binary syntactic trees having syntactic structures of music are generated.

2.2 Evaluation of The Structure Analysis Algorithm
We analyzed syntactic structure of music by applying the structure analysis algorithm to several musical scores.
The result of it was very similar with human perception of musical structures except negligible difference. It can construct better binary syntactic trees from the score of non-stressed start. because the minimum analysis unit of this algorithm is a single note.
Phrasing of melody contain the rests works well, since this algorithm deals with rests.
It can not construct better trees very well, if it is applied to similar melody of pitch transition pattern played by changing rhythm pattern, because its free degree of rhythm analysis low.

3. Analysis of correspondence between structures and performances of music

3.1 Elements analyzed from performances
We analyze piano performances played by humans to construct rules that describe the relationship between the binary syntactic trees and the human performances. We propose a performance model, based on the three following presumptions.

P1. We discriminate right and left hands.
P2. Each notes occupies the interval between its key-on time and its next note's. [Ohta, 1987]
P3. All the keys of notes in a chord are pressed at the same time.

The following parameters define our performance model in these presumptions.
(1) M(i) is the local metronomic speed value of i'th note in a musical score.
(2) A(i) is the ratio parameter between a performance time(a time of key-on to key-off) of i'th note in a musical score and its time in M(i).
(3) V(i) is the dynamics parameter of i'th note.(equal to the velocity value of MIDI.)

One can see tempo change in performances from M(i) and articulations from A(i). The A(i) relates to the musical terms below. We consider it plays an important role to derive the fluent melody lines and beautiful sounds in the musical performance.

A(i) = 1 means legato.

A(i) < 1 means staccato.
A(i) > 1 means legatissimo.

The three parameters were obtained from data disks sold public, called "MUSIC DISK for YAMAHA PIANO PLAYER". It may not modify information of playing time for the reason of difficulties though we guess the maker may modify several mistaken touches. The highest note among the notes that constitute a chord, determines M(i), A(i) and V(i) in a chord because of the two following reasons.

(1) A human player does not press keys of chord notes at the same time in performances usually. Several inspections indicate he presses all the chord notes when he presses the highest note.
(2) The highest note is highlighted among chord notes.

We regard a rest as a part of its antecedent in investigating M(i) and A(i).

3.2 The relationship between syntactic structure and performance
We research the relationship between the syntactic tree sequence generated by the structure analysis algorithm and the results of analyzing performance by the performance model in section 3.1.
We found the two facts as follows:

(1) the same top trees occurred in the tree sequence are perform in very similar manner as each other.
(2) the similarity of the performance of the same subtrees depends upon the equivalence of the paths between the subtrees and the top nodes.

We conclude that we must consider the hierarchy and branching of binary syntactic trees when we construct the rules from relation of between musical structures and performance.
In this field, [Friberg, 1991] rules for musical performance. The rules is not played expressive performance from syntactic structures of music.

4. The Performance System
We have made a performance system which plays a MIDI piano, in accordance with performance data generated by applying the

rules to syntactic structures of music.
We will evaluate the rules which are constructed according to the results in section 3, by listening to the MIDI piano performance this system plays.

5. Conclusion
We designed an automatic expressive music performing system by using syntactic structures of music.
As the first step, we designed an algorithm which automatically analyzes musical structures from musical score information, formalized syntactic structures of music as a sequence of binary syntactic trees. This algorithm derived syntactic structures of music which resembled human perception of musical structures by applying several musical score.
Next, we defined a performance model which has three parameters, and analyzed piano performance played by humans.
We also researched relationship between musical structures and musical performance played by humans from the result data of musical structures and human performance.
We also found out that the difference of performance on the same tree depended upon the location of the hierarchy of the sequence of binary syntactic trees. We are constructing the rules of relation between information getting from binary syntactic trees and expressive performance.
Quantitative description and generalization of the rules for expressively performing from information of hierarchy and branching of binary syntactic trees will be the next themes.

References
[Friberg, 1991] Anders Friberg. Generative Rules for Music Performance: A Formal Description of a Rule System. Computer Music Journal, Vol.15, No.2, pp.56-71, 1991.
[Noike, 1992] Kenzi Noike. The research of relationship between musical structures and tempos, dynamics in the piano performance. Bulletin of JMACS, No.46, pp.19-20, 1992
[Ohta, 1987] Masahisa Ohta and Tomoyasu Taguti. Analysis of agogics in performed piano music through the local metronomic speed. Proceedings of Konan University, pp.53-74, 1987.