House of Commons Hansard ˜Big Data

Hansard –  1803-2005,  7.6 million speeches, 1.6 billion words. This Hansard corpus (or collection of texts) contains nearly every speech given in the British Parliament from 1803-2005, and it allows you to search these speeches (including semantically-based searches) in ways that are not possible with any other resource

The SAMUELS project (Semantic Annotation and Mark-Up for Enhancing Lexical Searches) allows searching of the  7,545,101 texts (by nearly 40,000 individual speakers)

This does not provide full copies of the texts. You would need to obtain these from the Hansard archive.

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s