![flow chart](https://musicinformatics.gatech.edu/wp-content_nondefault/uploads/2023/07/east-flowchart.png)
Overall pipeline of training a model by using pre-trained embeddings as teachers. The training loss is a weighted sum (weighting factor omitted in the figure) of prediction loss and regularization loss. The regularization loss measures the distance between pre-trained embedding and the output feature map after the feature alignment. During inference, only the bottom part with the blue background is used.