Reuters for Machine Learning

Smart news data for smarter training

Upgrade Your Artificial Intelligence

Acquiring data for AI is an all-important decision. Reuters makes this straightforward with over 30 years of intelligently categorized news data, providing unparalleled depth to fuel your project.

Essential Training Data

The quality of artificial intelligence relies upon robust, versatile training data.

For over 160 years, Reuters has built a legacy as a trusted source of accurate, bias-free information, and the quality of our news data reflects this.

With sophisticated metadata found in over 45 million pieces of fully-licensable news content, Reuters material is an indispensable source of training data, whatever your project is.

Broad Functionality

With the demand for training material comes the need for extensible, future-proof data. Leveraging our newswires, images and video services, Reuters enhances your artificial intelligence by tailoring to key training functions.

Why is Reuters News Data Different?

Our news data is professionally produced and fully-licensed, allowing you to reach insights with greater speed and effectiveness:

  • Rights: Reuters has the proprietary rights to our data corpus and visual assets
  • Trust & Accuracy: Over 2000 media companies rely on Reuters news to make editorial and business decisions every day. Guided by Reuters Trust principles, our news preserves integrity, independence and freedom from bias
  • Diversity: Broad coverage of major topics from over 200 global locations and 16 languages, including business, finance, politics, sports, entertainment, technology, and much more
  • Metadata: Our advanced metadata contains regional and category-specific codes, allowing for intelligent grouping

New Data Use Cases

Machine Translation
30 million articles, 16 languages. Our extensive body of newswires is readily translated into multiple languages with parallel sentencing.

Visual Analysis Training
Advanced image metadata promotes the detection of action and events within our collection of over 13 million images.

Speech to Text Training
Closed-captions and video transcripts in over 800K video assets facilitate advanced training to identify figures, scenes and soundbites.

Knowledge Graphs
Entity-rich data gives contextual value and promotes the building of sophisticated knowledge graphs.

REUTERS/Kacper Pempel
REUTERS/Bobby Yip

Unmatched Diversity of Content

As the world’s largest news agency, Reuters continuously produces substantial
multimedia content, enabling you to thoroughly test and build your AI.

Our large body of trusted news data continues to grow on a daily basis:

  • 200 transcripted videos added per day
  • Over 1,500 images with intelligent metadata added per day
  • 2.2 million translated text articles added every year

Train Smarter

Find out more about how Reuters licensed data can meet your training requirements.