About

I am an Assistant Professor at Tehran Institute for Advanced Studies (TeIAS) and an Affiliated Lecturer at the University of Cambridge.

My primary research interest lies in the area of Natural Language Processing (NLP) where I have worked on different problems in lexical semantics, such as semantic representation and similarity, sense representation, word sense disambiguation, and ontology construction and alignment.

[google scholar] [twitter]

Background

Iran University of Science and Technology

I was an Assistant Professor at the Computer Engineering department of IUST (2018-2020). During this time, I taught multiple graduate and undergraduate courses (AI, Deep Learning, NLP, and Advanced Programming) and supervised several graduate students.


University of Cambridge

I was a Research Associate at the Language Technology Lab of the University of Cambridge (2015-2018). During this, I worked on PheneBank (PI: Dr. Nigel Collier), a project at the intersection of Natural Language Processing and Biomedical sciences, funded by the UK's Medical Research Council.

Sapienza University

I did my PhD at the Linguistic Computing Laboratory of the Sapienza University of Rome, Italy. My PhD advisor was Dr. Roberto Navigli with whom I did research in lexical semantics with a specific focus on the unified semantic representation of different linguistic items.

News

    Our book on Embeddings in NLP is finally out! Check here for prints and ebooks (more info). An early draft of the book can be obtained from here.

  • Contact me if you are interested in attending undergraduate or graduate NLP study groups.
  • I will be the Program Co-Chair of *SEM-2022, with Ellie Pavlick (Brown University).
  • Oct 2021: Our work on the PheneBank project was accepted for publication at Bioinformatics.
  • Sep 2021: The Word-in-Context dataset has a dedicated subsection in the 3rd edition of Jurafsky&Martin Speech and Language Processing textbook!
  • Sep 2021: EMNLP-2021 papers: Token-level probing of BERT (main), A critical investigation of dataset bias mitigation techniques (Findings), Isotropy of fine-tuned spaces (Findings), A cross-model comparison of BERToid models (BlackboxNLP).
  • Aug 2021: To give a talk on "Prompting in NLP" in ASOC 2021.
  • Aug 2021: Gave a joint talk, with Amir Hesam Salavati (CTO of Achareh and Ubaar), on Data Science.
  • Aug 2021: Teaching a 5-day course at ESSLLI 2021 on Embeddings in NLP, with Jose.
  • May 2021: My student's work on isotropy enhancement was accepted as a short paper at ACL 2021.
  • May 2021: ParsFEVER (a new dataset for Farsi fact verification) was accepted to be presented *SEM 2021.
  • Apr 2021: Check out my students' work on token-level analysis of BERT.
  • Mar 2021: Our comprehensive analysis article (on language models and lexical ambiguity) to appear on Computational Linguistics!
  • Feb 2021: Teaching NLP (for the first time!) this semester.
  • Feb 2021: New collaborations with the Economics Faculty (UoC), to appear in WASSA-2021 and Hackashop-2021.
  • Jan 2021: WiC-TSV (target sense verificatoin) was accepted at EACL-2021. The benchmark provides a novel way of evaluating lexical ambiguity for constrained domains.
  • Nov 2020: Teaching Deep Learning (graduate course) at TeIAS, co-teaching AI course at IUST with Sauleh Eetemadi.
  • Sep 2020: Check out our comprehensive overview and analysis of Language Models and Word Sense Disambiguation.
  • Sep 2020: XL-WiC to appear at EMNLP 2020.
  • Aug 2020: to give a talk on "Embeddings in NLP" at IPM ASOC 2020, Aug 22-26.
  • July 2020: To serve as a member of Steering Committee for SemEval-2021, Senior AC for ACL-2021, Senior PC for AAAI-2021 and IJCAI-2021, and AC for NAACL-2021 and *SEM2020.
  • June 2020: Results are out for SemEval-2020 Task 3, a task for fine-grained measurement of contextual word similarity.
  • May 2020: Happy to be moving to TeIAS, a new research institute in Tehran, that would allow me to stay more focused on my research. Very grateful to colleagues and friends at IUST for being so helpful and welcoming in the past two years.

Publications

D. Loureiro, K. Rezaee, M. T. Pilehvar, J. Camacho-Collados
Language Models for Word Sense Disambiguation.
Computational Linguistics 2021.

A. Breit, A. Revenko, K. Rezaee, M. T. Pilehvar, J. Camacho-Collados
WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context
EACL 2021.

C. Conforti, J. Berndt, M. T. Pilehvar, C. Giannitsarou, F. Toxvaerd, and N. Collier
STANDER: An Expert-Annotated Dataset for News Stance Detection and Evidence Retrieval.
Findings of EMNLP 2020.

A. Raganato, T. Pasini, J. Camacho-Collados, and M.T. Pilehvar
XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization.
EMNLP 2020.

C. Conforti, J. Berndt, M. T. Pilehvar, C. Giannitsarou, F. Toxvaerd, and N. Collier
Will-They-Won't-They: A Very Large Dataset for Stance Detection on Twitter.
ACL 2020.

M Gritta, MT Pilehvar, and N Collier
A pragmatic guide to geoparsing evaluation.
Language Resources and Evaluation, 2020.

V. Prokhorov, M. T. Pilehvar, D. Kartsaklis, P. Lio, and N. Collier
Unseen Word Representation by Aligning Heterogeneous Lexical Semantic Spaces.
AAAI 2019, Hawaii, USA.

M. T. Pilehvar and J. Camacho-Collados
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations.
NAACL 2019, Minneapolis, USA.


M. T. Pilehvar
On the Importance of Distinguishing Word Meaning Representations: A Case Study on Reverse Dictionary Mapping.
NAACL 2019, Minneapolis, USA.

V. Prokhorov, M. T. Pilehvar, and N. Collier
Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models.
NAACL 2019, Minneapolis, USA.

Teachings

Artificial Intelligence (undergraduate)

I have taught this course several times at IUST: [spring 97][fall 97][fall 98][spring 99][fall 99]


Deep Learning (graduate)

My deep learning courses are mostly applied, with special focus on working with deep learning frameworks, such as Keras, Tensorflow, and Pytorch. [fall 97][fall 98][fall 99]


Deep Learning (undergraduate)

I usually dedicate the last few weeks of the undergraduate AI course to teaching basics of deep learning. However, due to high interest, I once tried having a full undergraduate course on deep learning in [spring 98].


Natural Language Processing (graduate)

I'm currently teaching a graduate course on the topic (for the first time). Please feel free to join!


Advanced Programming (undergraduate)

My Java programming course in [fall 97].

Other

I enjoy stargazing laying flat on my back outside under the clear skies of my hometown (Hamedan). Many years ago, when I was younger, I used to do some astrophotography. I used to send my photos to Spaceweather.com. I think this is my last photo published on Spaceweather. I once had a photo published in Astronomy Magazine (the largest U.S. magazine on the subject).



Contact

  • mp792@cam.ac.uk

Department of Theoretical and Applied Linguistics
University of Cambridge
Sidgwick Avenue
Cambridge
CB3 9DA
United Kingdom