Global Database of Events, Language, and Tone

From Devec

The Global Database of Events, Language, and Tone

Summary

Glossary

Various acronyms and terms are used in the GDELT documentation. Since there seems to be no unified place for these on the GDELT website, we collect them here.

Term Expansion Meaning Examples
GDELT Global Database of Events, Language, and Tone The project name N/A
GKG Global Knowledge Graph One of the tables in GDELT
CAMEO Conflict and Mediation Event Observations ????
CAMEO code Conflict and Mediation Event Observations code A defined code used in CAMEO Example verbs: Express intent to institute political reform, not specified below (CAMEO 034), Allow humanitarian access (CAMEO 0863). Example actors: IGOUNO (the United Nations), COP (Police officers, officers, criminal investigative units, protective agencies). Example religions: ATH (Atheism/Agnosticism), CHR (Christianity).[1]Template:Rp
GCAM GDELT Global Content Analysis Measures ????
Actor N/A An entity mentioned in an event Barack Obama, Russia, Microsoft, United Church of Christ in Japan
Event N/A
Mention N/A
Codebook N/A Documentation of the table schemas
LIWC Linguistic Inquiry and Word Count
RID ?? mentioned in [1]
GNS mentioned in [2]
GNIS mentioned in [3]
TABARI Text Analysis By Augmented Replacement Instructions[2] N/A
IGO
NGO

Versions

Version Formats available Year of publication Month of publication Years of coverage Dimensions (inputs) Metrics (outputs)
GDELT 1.0 Raw data files, Google BigQuery 1979–present
GDELT 2.0 Raw data files, Google BigQuery
GDELT Visual Knowledge Graph 1.0 Raw data files, Google BigQuery 2016 February[3] Images
GDELT 3.0 2017?[4]

Data description

GDELT 2.0 contains three tables: Events, Mentions, and the Global Knowledge Graph (GKG). In both Mentions and the GKG, each row in the table is an article about an event; the difference seems to be that the GKG contains more columns. On the other hand, in Events each row is an event, and only the first article that mentions the event is stored.

Data dimensions and metrics

GKG 2.0 fields from [5]

GDELT 2.0 Events and Mentions from [6]

Table name General field type Field names giving information about the general field type
GKG 2.0 Date DATE, Dates
GKG 2.0 Location Locations, V2Locations
GKG 2.0 Entities Persons, V2Persons, Organizations, V2Organizations
GKG 2.0 Topic Themes, V2Themes
GKG 2.0 Sentiment V2Tone
GKG 2.0 Source text Quotations
GDELT 2.0 Mentions Date EventTimeDate, MentionTimeDate
GDELT 2.0 Mentions Source text MentionSourceName, MentionIdentifier, MentionDocLen
GDELT 2.0 Mentions Entities Actor1CharOffset, Actor2CharOffset
GDELT 2.0 Mentions Sentiment MentionDocTone
GDELT 2.0 Events Date Day, MonthYear, Year, FractionDate
GDELT 2.0 Events Entities Actor1Code, Actor1Name, Actor1CountryCode, Actor1KnownGroupCode, Actor1EthnicCode, Actor1Religion1Code, Actor1Religion2Code, Actor1Type1Code, Actor1Type2Code, Actor1Type3Code (repeated for Actor2)
GDELT 2.0 Events Sentiment AvgTone
GDELT 2.0 Events Source text NumArticles, NumSources, NumMentions, DATEADDED, SOURCEURL
GDELT 2.0 Events Location Actor1Geo_Type, Actor1Geo_Fullname, Actor1Geo_CountryCode, Actor1GeoADM1Code, Actor1Geo_ADM2Code, Actor1Geo_Lat, Actor1Geo_Long, Actor1Geo_FeatureID (repeated for Actor2 and Action)

Data sources

GDELT finds news articles through some process, but it's not clear what that process is.

Auxiliary

In addition to the actual news articles, some auxiliary data sources are used for sentiment analysis and location identification.[7]

Methods of estimation

Reception

Usage in debates

See also

External links

References

  1. Script error: No such module "citation/CS1".
  2. Script error: No such module "citation/CS1".
  3. Script error: No such module "citation/CS1".
  4. Script error: No such module "citation/CS1".
  5. Script error: No such module "citation/CS1".
  6. Script error: No such module "citation/CS1".
  7. Script error: No such module "citation/CS1".

Script error: No such module "Check for unknown parameters".