NPR (national public radio) interview dialog data
Creators:
Julian McAuley
Publication Date:
2020-xx-xx
Data Category:
Dataset Description:
The NPR Media Dialog Transcripts dataset contains interview transcripts from National Public Radio (NPR) programs, spanning approximately 20 years. This dataset includes over 140,000 transcripts, covering more than 10,000 hours of audio content. This results in 3.2 million utterances and 126.7 million words collected. Each transcript provides detailed information such as episode titles, broadcast dates, speaker names, and the full text of conversations between hosts and guests. These features are valuable for analyzing discourse patterns, speaker interactions, and content trends within NPR's programming. Structurally, the dataset is organized into individual transcripts, each corresponding to a specific NPR episode. Each transcript includes metadata such as the episode title, broadcast date, program name, speaker identities, and the full text of the interview.
Variables:
Details: