Hansard Debates through a Telescope: Two centuries of digital Parliamentary records – free event in Edinburgh 13th Nov 2019

Prof. Marc Alexander discusses how semantic queries are enabled for the Hansard Corpus, a digital record of 7.6 mill UK Parliament speeches.


The Hansard Corpus (1803-2003) contains 7.6 million speeches from the UK Parliament, which are not verbatim. Its size – 1.6 billion words – means that it is particularly unwieldy to explore digitally. As a result, in 2015 it was tagged semantically using the tagset of the Historical Thesaurus of English in order to enable semantic queries and aggregation. In this talk, I will discuss what the corpus represents, the overall picture of the Parliamentary record from a semantic point of view (‘through a telescope’), and what such digital parliamentary records can tell us.

More information and booking via eventbrite

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s