Machine Reading

An NLP research group at the UCL Computer Science department teaching machines how to read.

The amount of published information is growing rapidly. Much of this information comes in the form of unstructured text which cannot easily be searched, mined, visualized or, ultimately, acted upon. The principal goal of our group is to build machines that can read and "understand" this textual information, converting it into interpretable structured knowledge to be leveraged by humans and other machines alike.

To achieve our goal we work in the intersection of Natural Language Processing and Machine Learning. We rely heavily on statistical methods of various flavours.

Our group is part of the UCL Computer Science department, affiliated with CSML and based in the London Media Technology Campus. We are organizing the South England Natural Language Processing Meetup. Get in touch if you're interested in attending.

If you are interested in doing a PhD with us, please have a look at these instructions.

Interpretation of Natural Language Rules in Conversational Machine Reading accepted at EMNLP 2018 - CodaLab challenge are now LIVE at sharc-data.github.io!
Wronging a Right: Generating Better Errors to Improve Grammatical Error Detection accepted at EMNLP 2018 - code available at this link!
Cape achieves new SoTA on the TriviaQA Wiki dataset codalab Leaderboard. More details can be found here. Cape is super easy to use, extend and integrate into all kinds of software!
Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge accepted at CoNLL 2018 - code available at this link!
We are proud of our 2nd place in the EMNLP 2018 FEVER shared task (fever.ai) thanks to the amazing work of @takuma_ynd!
We will be giving a tutorial on Machine Reading at UAI 2018!
Numeracy for Language Models: Evaluating and Improving their Ability to Predict Numbers accepted at ACL!
Jack the Reader – A Machine Reading Framework accepted at ACL, System Demonstrations track!
Behavior Analysis of NLI Models: Uncovering the Influence of Three Factors on Robustness accepted at HLT-NAACL!
Convolutional 2D Knowledge Graph Embeddings accepted at AAAI!
The 6th Workshop on Automated Knowledge Base Construction (AKBC 2017) returns to NIPS: submit your papers by October 21st!
End-to-end Differentiable Proving accepted at NIPS!
Adversarial Sets for Regularising Neural Link Predictors accepted at UAI!
Programming with a Differentiable Forth Interpreter accepted at ICML!
A Supervised Approach to Extractive Summarisation of Scientific Papers, a paper based on Ed Collins' MEng thesis, accepted at CoNLL!
SemEval 2017 Science task description paper preprint now available online
Tim Rocktäschel is awarded a Google Ph.D. Fellowship in Natural Language Processing
Neural Architectures for Fine-grained Entity Type Classification wins outstanding paper award at EACL!
Multi-Task Learning of Keyphrase Boundary Classification accepted at ACL!
We co-organised a Poetry AI workshop at UCL, at which humans acted as a neural network to generate poetry. Slides of the event are available here.
Wired article discussing our project on machine reading of scientific publications and how it could aid peer review
Frustratingly Short Attention Spans in Neural Language Modeling, a paper based on Michal Daniluk's MSc Machine Learning project, accepted to ICLR! Michal also received the MSc Machine Learning Programme Director’s Award (2015/2016) for Outstanding Project Report (Second Place)
2 papers by our group accepted at EACL!
SemEval 2017 Science results are announced. Congratulations to the winning teams, s2_end2end, MayoNLP and MIT!
We are co-organising the first workshop for women and underrepresented minorities in NLP (WiNLP) at ACL 2017. Consider participating!
emoji2vec: Learning Emoji Representations from their Description, a paper based on Ben Eisner's UCLMR internship, won the best paper award at SocialNLP 2016!
We are co-organizing a workshop on Neural Abstract Machines & Program Induction (NAMPI) at NIPS 2016. Consider submitting a paper!
We are co-organizing the SemEval 2017 Task 10: Extracting Keyphrases and Relations from Scientific Publications (ScienceIE). Consider participating!
Defining Words with Words: Beyond the Distributional Hypothesis was awarded the best proposal award at RepEval!
4 papers by our group accepted at EMNLP!

Tweets by @uclmr

Sebastian Riedel Reader

Sebastian works in NLP and Machine Learning. He is particularly interested in helping machines to read more accurately by leveraging knowledge gathered through reading more accurately.
Matko Bošnjak Final year PhD Student

Matko's interests include both natural and unnatural language processing, and their interplay. Specifically, he's enjoying differentiable abstract machines and interpreters, code induction, and trainable combinations of neural networks and code. When tired from unnatural language, he can be found enjoying a good question answering model.
Ingolf Becker 3rd year PhD Student

Ingolf researches into the intersection of NLP and Information Security. His work combines topic models, sentiment analysis and statistical tests to transcripts on security topics, attempting to automatically infer conflicts between security and business processes.
Pontus Stenetorp Senior Research Associate

Pontus works somewhere in the intersection between Natural Language Processing and Machine Learning. He is particularly interested in representation learning and is currently funded by a machine reading grant from the Allen Foundation.
Johannes Welbl 3rd year PhD Student

I'm interested in Machine Learning and NLP, in particular Reading Comprehension and Knowledge Base Inference. Currently I work on multi-step Reading Comprehension, a scenario in which a model combines multiple facts to arrive at an answer.
Jeff Mitchell Research Associate

I'm interested in using machine reading technology to extract and verify facts from raw text.
Pasquale Minervini Research Associate

Pasquale is interested in Machine Reading, and how to leverage background knowledge in representation learning algorithms. He is currently funded by a machine reading grant from the Allen Foundation.
Tom Crossland PhD Student

Tom is an astrophysicist working with the MR group and the Mullard Space Science Laboratory, interested in Machine Learning applications to his original subject area. He is currently working on automatic measurement extraction from scientific literature, with a view to applying the results to galactic archaeology.
Ed Grefenstette Honorary Reader

Ed is interesting in teaching machines to understand and communicate using language (formal and natural), and in both neural and symbolic reasoning (and the intersection thereof). He is involved with UCLMR's research activities alongside a full time role in industry.
Patrick Lewis First Year PhD Student

Patrick is a first year PhD student, interested in Transfer Learning, Machine Reading and leveraging world knowledge to improve predictions in NLP systems.
Yuxiang Wu First Year PhD Student

Yuxiang is a first year PhD student, interested in combining connectionism and symbolism to improve the reasoning capability of NLP systems.
Saku Sugawara Visiting PhD Student

Saku is a Ph.D. student at the University of Tokyo, interested in natural language understanding by machines.

Andreas Vlachos Research Associate

Now a senior lecturer @ University of Cambridge.
Luke Hewitt Intern

Now a PhD student @ MIT.
Gerasimos Lampouras Research Associate

Now a research associate @ University of Sheffield.
Sonse Shimaoka Intern

Now a master student @ Tohoku University.
Guillaume Bouchard Senior Research Associate

Now a Research Manager @ Facebook
Thomas Demeester Visiting Researcher

Now a post-doc @ University of Ghent
Jason Naradowsky Research Associate
Now a research scientist @ Preferred Networks (PFN)
Théo Trouillon Visiting PhD Student

Now back to being a PhD student @ Xerox Research Centre Europe
Marzieh Saeidi PhD Student

Now a Research Scientist @ Facebook.
Isabelle Augenstein Research Associate

Now an assistant professor @ University of Copenhagen.
Tim Rocktäschel PhD Student

Now a Research Scientist @ Facebook & Lecturer @ UCL.
Naoya Inoue Visiting Researcher

Now an assistant professor @ Tohoku University.
Tim Dettmers Intern

Now a PhD student @ University of Washington.
V. Ivan Sanchez PhD Student

Now an NLP researcher @ Lenovo
Andres Campero Visiting PhD Student

Now back to being a PhD student @ MIT.
Takuma Yoneda Intern

Now a student @ Toyota Technological Institute at Chicago
Georgios Spithourakis PhD Student

Now a ML engineer @ PolyAI

stat-nlp-book is an interactive Statistical NLP book in Python, used for our StatNLP from 2016 on
stat-nlp-book-scala is an interactive Statistical NLP book in Scala, used for our StatNLP course in 2015/16
Jack the Reader is a Machine Reading framework for Question Answering, Natural Language Inference, and Link Prediction - see the paper here.
Neural Theorem Prover is a end-to-end differentiable logic reasoner, implementing the model described in End-to-end Differentiable Proving.
Inferbeddings is a link prediction framework that allows including First-Order background knowledge via adversarial training - the model is described in Adversarial Sets for Regularising Neural Link Predictors.
wolfe is a framework for building rich machine learning models, based on functional programming, factor graphs, optimization and composition.
ucleed is a biomedical event extractor that ranked first in several tracks of the BioNLP 2011 shared task.
thebeast is a Markov Logic inference and learning engine.
What's Wrong With My NLP? is a visualizer for NLP problems.

Machine Reading

Introduction

News

People

Sebastian Riedel Reader

Matko Bošnjak Final year PhD Student

Ingolf Becker 3rd year PhD Student

Pontus Stenetorp Senior Research Associate

Johannes Welbl 3rd year PhD Student

Jeff Mitchell Research Associate

Pasquale Minervini Research Associate

Tom Crossland PhD Student

Ed Grefenstette Honorary Reader

Patrick Lewis First Year PhD Student

Yuxiang Wu First Year PhD Student

Saku Sugawara Visiting PhD Student

Alumni

Andreas Vlachos Research Associate

Luke Hewitt Intern

Gerasimos Lampouras Research Associate

Sonse Shimaoka Intern

Guillaume Bouchard Senior Research Associate

Thomas Demeester Visiting Researcher

Jason Naradowsky Research Associate

Théo Trouillon Visiting PhD Student

Marzieh Saeidi PhD Student

Isabelle Augenstein Research Associate

Tim Rocktäschel PhD Student

Naoya Inoue Visiting Researcher

Tim Dettmers Intern

V. Ivan Sanchez PhD Student

Andres Campero Visiting PhD Student

Takuma Yoneda Intern

Georgios Spithourakis PhD Student

Software