ECTQA: Expanding Earnings Call Transcript Summarization

Abstract

There has been significant progress in automatic text summarization yet financial document summarization is still an emerging field of study, particularly for public company's earnings calls. Techniques to efficiently and effectively summarize company earnings calls in a comprehensive yet consumable manner are still in their infancy. In this paper, we present ECTQA, a novel dataset of the Q&A sections of earnings call transcripts (ECTs) hosted by publicly-traded companies and reference summaries generated by a salient sentence extractive model. These Q&A sections are typically 90% of any given earnings call and are largely under-analyzed by academic research and financial news organizations. We also present methods to fine-tune the ECT-BPS modeling framework that generates both extractive and abstractive paraphrased summaries of the Prepared Remarks of ECTs with the intent to apply it to our novel ECTQA dataset.

Data

The ECTQA dataset can be found in the data folder.

Models

The extractive, abstractive, unsupervised, and long document summarization models dan be found in the models folder.

Code

The notebooks can be found in the code folder.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
code		code
data/ECTQA		data/ECTQA
models		models
ECTQA_w266_summer_2023_Abrahamian_Azinge.pdf		ECTQA_w266_summer_2023_Abrahamian_Azinge.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ECTQA: Expanding Earnings Call Transcript Summarization

Abstract

Data

Models

Code

About

Releases

Packages

Languages

andrewabrahamian/ectqa

Folders and files

Latest commit

History

Repository files navigation

ECTQA: Expanding Earnings Call Transcript Summarization

Abstract

Data

Models

Code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages