SocialGrep/the-2022-trucker-strike-on-reddit · Datasets At Hugging ...

Dataset Card for the-2022-trucker-strike-on-reddit

Dataset Summary

This corpus contains all the comments under the /r/Ottawa convoy megathreads.

Comments are annotated with their score.

Languages

Mainly English.

Dataset Structure

Data Instances

A data point is a Reddit comment.

Data Fields

  • 'type': the type of the data point. Can be 'post' or 'comment'.
  • 'id': the base-36 Reddit ID of the data point. Unique when combined with type.
  • 'subreddit.id': the base-36 Reddit ID of the data point's host subreddit. Unique.
  • 'subreddit.name': the human-readable name of the data point's host subreddit.
  • 'subreddit.nsfw': a boolean marking the data point's host subreddit as NSFW or not.
  • 'created_utc': a UTC timestamp for the data point.
  • 'permalink': a reference link to the data point on Reddit.
  • 'score': score of the data point on Reddit.
  • 'sentiment': the evaluated sentiment of the data point, if any.
  • 'body': the text of the data point.

Dataset Creation

Curation Rationale

[Needs More Information]

Source Data

Initial Data Collection and Normalization

[Needs More Information]

Who are the source language producers?

[Needs More Information]

Annotations

Annotation process

[Needs More Information]

Who are the annotators?

[Needs More Information]

Personal and Sensitive Information

[Needs More Information]

Considerations for Using the Data

Social Impact of Dataset

[Needs More Information]

Discussion of Biases

[Needs More Information]

Other Known Limitations

[Needs More Information]

Additional Information

Dataset Curators

[Needs More Information]

Licensing Information

CC-BY v4.0

Contributions

[Needs More Information]

Tag » Why Are Truckers Protesting Reddit