likes

Showing 1-2 of 2 results

Graph Embedding with Self Clustering: Facebook; GEMSEC

Creators: Rozemberczki, Benedek; Davies, Ryan; Sarkar, Rik; Sutton, Charles
Publication Date: 2019
Creators: Rozemberczki, Benedek; Davies, Ryan; Sarkar, Rik; Sutton, Charles

We collected data about Facebook pages (November 2017). These datasets represent blue verified Facebook page networks across eight distinct categories: Government, News Sites, Athletes, Public Figures, TV Shows, Politicians, Artists, and Companies. In this dataset, nodes represent individual Facebook pages, and edges denote mutual likes between these pages, reflecting the interconnectedness within and between different interest groups.  We reindexed the nodes in order to achieve a certain level of anonymity. The csv files contain the edges — nodes are indexed from 0. We included 8 different distinct types of pages. For each dataset we listed the number of nodes an edges. The dataset’s size varies by category, with the largest subset (Artists) containing 50,515 nodes and 819,306 edges, and the smallest subset (TV Shows) comprising 3,892 nodes and 17,262 edges. In total, the dataset has a size of 0,005 GB and encompasses 134,833 nodes and 1,380,293 edges, offering a rich source for analyzing the structure and dynamics of Facebook page interactions. Structurally, the dataset is divided into eight sub-datasets, each corresponding to a specific category of Facebook pages.

Instagram Influencer Marketing Dataset

Creators: Kim, Seungbae; Jiang, Jyun-Yu; Nakada, Masaki; Han, Jinyoung; Wang, Wie
Publication Date: 2020
Creators: Kim, Seungbae; Jiang, Jyun-Yu; Nakada, Masaki; Han, Jinyoung; Wang, Wie

This dataset contains 33,935 Instagram influencers who are classified into the following nine categories including beauty, family, fashion, fitness, food, interior, pet, travel, and other. The dataset is 262 GB in size, including both metadata in JSON format and images in JPEG format. We collect 300 posts per influencer so that there are 10,180,500 Instagram posts in the dataset. The dataset includes two types of files, post metadata and image files. Post metadata files are in JSON format and contain the following information: caption, usertags, hashtags, timestamp, sponsorship, likes, comments, etc. Image files are in JPEG format and the dataset contains 12,933,406 image files since a post can have more than one image file. If a post has only one image file then the JSON file and the corresponding image files have the same name. However, if a post has more than one image then the JSON file and corresponding image files have different names. Therefore, we also provide a JSON-Image_mapping file that shows a list of image files that corresponds to post metadata.

If you want to use this dataset, please cite it accordingly. The data can be accessed on the respective website link below.

“Multimodal Post Attentive Profiling for Influencer Marketing,” Seungbae Kim, Jyun-Yu Jiang, Masaki Nakada, Jinyoung Han and Wei Wang. In Proceedings of The Web Conference (WWW ’20), ACM, 2020.

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.