New Open Source Tool – TM Cleaner

We are happy to announce Translation Memory (TM) Cleaner , a software for identifying the translation units in translation memories or parallel corpora that contains segments that are not translations of each other. TM Cleaner has the following features : 1. It is written in python and java. 2. It is hosted at github :

Continue Reading

How do I create a customised machine translation engine?

The press releases and general hype around machine translation issued by IT giants like Google, Skype/Microsoft and IBM has increased the expectations of machine translation (MT) in recent years. Such PR work has led even the BBC to question whether human translators are facing the end of the line. This is only opposed by a

Continue Reading

Intelligent assistant landscape shows slow growth but huge potential

Ever since Apple’s Siri heralded the age of intelligent assistants (IAs) four years ago — followed by Microsoft, Google, Microsoft, and Facebook — pundits have complained that intelligent assistant technology isn’t living up to its promise. The truth is that innovation in this domain, as in all technological domains, follows a predictable cycle and goes

Continue Reading

Free summer schools in Corpus linguistics and other digital methods: Lancaster University, 12-15 July 2016

Lancaster University is pleased to offer six free training events that cover the techniques of corpus linguistics, computational analysis of language and geographical information systems. The following six Summer Schools will run in July 2016: o Corpus linguistics for Language studies o Corpus linguistics for Social Science o Corpus linguistics for the Humanities o Statistics

Continue Reading

Summer School in English Corpus Linguistics

The Survey of English Usage at University College London (UCL) will be running the fourth three-day Summer School in English Corpus Linguistics on 6-8 July 2016. The Summer School in English Corpus Linguistics is an introduction to Corpus Linguistics for students of language and linguistics and teachers of English. Participants should have a basic knowledge

Continue Reading

Summer School in English Corpus Linguistics

The Survey of English Usage at University College London (UCL) will be running the fourth three-day Summer School in English Corpus Linguistics on 6-8 July 2016. The Summer School in English Corpus Linguistics is an introduction to Corpus Linguistics for students of language and linguistics and teachers of English. Participants should have a basic knowledge

Continue Reading

The 38th European Conference on Information Retrieval : Student Travel Grant

The 38th European Conference on Information Retrieval 20-23 March 2016 Padua, Italy Student Travel Grant Deadline: 19 February 2016 http://ecir2016.dei.unipd.it/ http://twitter.com/ecir2016 ########################################################## The ECIR 2016 conference is pleased to announce the availability of 15 student travel grants made available by ELIAS (http://elias-network.eu/). Grant Scope ————— The ECIR 2016 student travel grant program provides up to

Continue Reading

Automating the Data Scientist

Talk about a fraught concept, this one ought to give you the willies. I don’t mean to be a Luddite about the magical abilities of technology but the concept here is to replace data scientists with software. As long as I have practiced in data science I am constantly coming upon new and unexpected reasons

Continue Reading