Abstract:
We uncover latent topics embedded in the management discussion and analysis (MD&A) of financial reports from the listed companies in the US, and we examine the evolution of topics found by a dynamic topic modelling method - Dynamic Embedding Topic Model. Using more than 203k reports with 40M sentences ranging from 1997 to 2017, we find 30 interpretable topics. The evolution of topics follows economics cycles and major industrial events. We validate the significance of these latent topics by the state-of-the-art performance of a simple bankruptcy ensemble classifier trained on both novel features - topical distributed representation of the MD&A, and accounting features.