One million comic books panel

Creators:

Iyyer, Mohit; Manjunatha, Varun; Guha, Anupam; Vyas, Yogarshi; Boyd-Graber, Jordan; Daumé III, Hal; Davis, Larry

Publication Date:

2016

Data Category:

Dataset Description:

Visual narrative is often a combination of explicit information and judicious omissions, relying on the viewer to supply missing details. In comics, most movements in time and space are hidden in the "gutters" between panels. To follow the story, readers logically connect panels together by inferring unseen actions through a process called "closure". While computers can now describe what is explicitly depicted in natural images, in this paper we examine whether they can understand the closure-driven narratives conveyed by stylized artwork and dialogue in comic book panels. We construct a dataset, COMICS, that consists of over 1.2 million panels (120 GB) paired with automatic textbox transcriptions. An in-depth analysis of COMICS demonstrates that neither text nor image alone can tell a comic book story, so a computer must understand both modalities to keep up with the plot. We introduce three cloze-style tasks that ask models to predict narrative and character-centric aspects of a panel given n preceding panels as context. Various deep neural architectures underperform human baselines on these tasks, suggesting that COMICS contains fundamental challenges for both vision and language. Overall, the dataset is organized into three components:

Panel Images: Each panel is stored as an image file, capturing the visual content of the comic scenes.
Textbox Transcriptions: Textual content from each panel is extracted using OCR, allowing for analysis of dialogues, narratives, and other textual elements.
Metadata: Additional information such as panel dimensions, position within the page, and associated comic book identifiers is included to facilitate detailed analyses.

Variables:

Details:

Bookmark this Dataset/Publication

One million comic books panel

UK Office for National Statistics (ONS) – Economic and Social Statistics

Replication Data: Shocks and Technology Adoption – Evidence from Electronic Payment Systems

FCA RegData – UK Financial Conduct Authority Regulatory Data

One million comic books panel

Sign In

Register

Reset Password