Dr. Terry Lima Ruas

Senior Researcher for Natural Language Processing

write email


Terry Ruas has completed his Ph.D. in Computer and Information Science at the University of Michigan – Dearborn in the USA, where he also worked as a Graduate Student Instructor for graduate and undergraduate courses. He completed his MSc and BSc in Information Engineering and Computer Science, respectively, at the Federal University of ABC (UFABC) in Brazil. During his graduate studies, he also worked in the industry for IBM for five years. He was also a Ph.D. intern at the National Institute of Informatics in Tokyo, Japan, and a summer teaching assistant at VelTech in Chennai, India.


Terry’s research interest is the result of an overlap between natural language processing, data science, and machine learning applied to text mining to extract semantic features. He explores how semantic features can solve different NLP downstream tasks and real-world problems. Jan Philip Wahle and Frederic Kirstein are closely involved with Terry’s projects.

Terry’s primary research interest topics are:

  • Natural language processing
  • Paraphrase generation and detection
  • Plagiarism detection
  • Scientific paper mining
  • Scientometrics
  • Text and meeting summarization
  • Machine learning
  • Data Science


Dr. Terry Lima Ruas

Papendiek 14
37073 Göttingen

Office: SUB Historic Building 0.208
Mail: ruas@gipplab.org

Social Network Profiles


07/2022 – Present

Senior Researcher for Natural Language Processing
Scientific Information Analytics Group, University of Göttingen, Germany

09/2019 – 06/2022

Senior Researcher
Data & Knowledge Engineering Group, University of Wuppertal, Germany

09/2015 – 08/2019

Computer and Information Science, Ph.D.
University of Michigan-Dearborn, Michigan, USA

01/2018 – 07/2018

Visiting Researcher
National Institute of Informatics (NII), Tokyo, Japan

12/2009 – 07/2015

Product Manager; IT Specialist
IBM, São Paulo, Brazil

02/2012 – 10/2013

Information Engineering, M.Sc.
Federal University of ABC (UFABC), São Paulo, Brazil

01/2007 – 12/2010

Computer Science and Science & Technology, B.Sc.
Federal University of ABC (UFABC), São Paulo, Brazil



Deep Learning for Natural Language Processing


Programming for Data Scientists: Python

Selected Topics in Data Science


Key Concepts in Computer Science

Applied Natural Language Processing and Text Mining


Key Concepts in Computer Science


Key Concepts in Computer Science

Applied Natural Language Processing and Text Mining


Key Concepts in Computer Science


Applied Natural Language Processing and Text Mining

Students projects

The slides here are examples of student research projects that I’m currently offering. Please let me know if you are interested in these projects or if you have other project ideas.


A complete list of my publications is available here

How Large Language Models are Transforming Machine-Paraphrase Plagiarism
J. P. Wahle, T. Ruas, F. Kirstein, and B. Gipp
In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 952–963, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics, December 07-11, 2022.
(PDF  DOI  BibTeX)

D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research

J. P. Wahle, T. Ruas, Saif M. Mohammad. Meuschke, B. Gipp
Proceedings of The 13th Language Resources and Evaluation Conference, LREC 2022, Marseille, France, June 20-25, 2022
(PDF  DOI  BibTeX)

Identifying Machine-Paraphrased Plagiarism
J. P. Wahle, T. Ruas, T. Foltynek, N. Meuschke, B. Gipp
Information for a Better World: Shaping the Global Future – 17th International Conference, iConference 2022.
(PDF  DOI  BibTeX)

Neural Media Bias Detection Using Distant Supervision With BABE – Bias Annotations By Expert
Spinde, T.; Plank, M.; Krieger, J.; Ruas, T.; Gipp, B.; and Aizawa, A. In Findings of the Association for Computational Linguistics: EMNLP 2021, pages 1166–1177, Punta Cana, Dominican Republic, 2021.
(PDF  DOI  BibTex)

Document Similarity for Research Papers
Ostendorff, M.; Ruas, T.; Blume, T.; Gipp, B.; and Rehm, G. In Proceedings of the 28th International Conference on Computational Linguistics (COLING), pages 6194–6206, Barcelona, Spain (Online), 2020.
(PDF DOI BibTex)

Multi-sense embeddings through a word sense disambiguation process.
Ruas, T.; Gorsky, W.; and Aizawa, A. Expert Systems with Applications, 136: 288 – 303. 2019.
( PDF DOI BibTex)