Diego Campos Moussallem

Diego Moussallem is a senior researcher at the Data Science group, Paderborn University. He is native from Brazil (at Rio de Janeiro) and has been living in Germany since 2015 when he was awarded a Ph.D. scholarship from National Council for Scientific and Technological Development (CNPq). Diego attained his bachelor in Information Systems at UVA (Veiga de Almeida University) in 2011, Brazil. Afterward, Diego obtained a Master of Science in Machine Translation at IME (Military Institute of Engineering) in 2014, Brazil. Nowadays, Diego is in his last Ph.D. year at Paderborn University (transfer from Leipzig University) and leads the NLP unit at DICE which is mainly focused on applying knowledge graphs and linked data concepts into NLP tasks such as machine translation, natural language generation, knowledge extraction (named entity recognition, entity linking and relation extraction), and question answering. Additionally, Diego also (co-)authored more than 20 peer-reviewed publications. Apart from his scientific academic knowledge, he has experience in the development of websites and software for business and open source projects using languages such as Python, Java, PHP, Html5, CSS, C++, SQL, and SPARQL. Moreover, Diego worked on several EU projects such as DIESEL, QAMEL, HOBBIT, GEISER, LIMBO, OPAL and SOLIDE.

Informações coletadas do Lattes em 07/08/2025

Acadêmico

Formação acadêmica

Doutorado em Data Science

2015 - 2019

Universität Paderborn, UNI/Paderbon
Título: Knowledge Graphs for Multilingual Language Translation and Generation
Orientador: Prof. Dr. Axel-Cyrille Ngonga Ngomo
Bolsista do(a): Conselho Nacional de Desenvolvimento Científico e Tecnológico, CNPq, Brasil. Palavras-chave: Semantic Web; Machine Translation; Natural Language Processing; Machine Learning; Big Data.Grande área: Ciências Exatas e da Terra

Mestrado em Sistemas e Computação

2012 - 2014

Instituto Militar de Engenharia
Título: Utilização de Ontologias para Desambiguação de Termos Homônimos em Traduções Simultâneas para Comunicadores de Tempo Real
, Ano de Obtenção: 2014.Ricardo Choren Noya.Bolsista do(a): Coordenação de Aperfeiçoamento de Pessoal de Nível Superior, CAPES, Brasil. Palavras-chave: web semâtica; tradução simultânea; processamento de linguagem natural; ontologia; sistemas de diálogo.Grande área: Ciências Exatas e da Terra

Graduação em Sistemas de Informação

2007 - 2011

Universidade Veiga de Almeida
Título: Comunicador Semântico com Tradução Simultânea
Orientador: Matheus Bousquet Bandini

Ensino Médio (2º grau)

2004 - 2006

Escola Domingos Savio

Ensino Fundamental (1º grau)

2002 - 2003

Escola Domingos Savio

Ensino Fundamental (1º grau)

2000 - 2001

Instituto São Francisco de Sales

Formação complementar

2013 - 2014

Formação JAVA. (Carga horária: 100h). , Caelum Ensino e Inovação, CAELUM, Brasil.

2012 - 2012

Windows Server 2003. (Carga horária: 64h). , NPI Brasil, NPI, Brasil.

2011 - 2011

Linguagem de Programação PHP. (Carga horária: 32h). , Universidade Veiga de Almeida, UVA/RJ, Brasil.

2010 - 2010

Web Designer e Designer Gráfico. (Carga horária: 64h). , S.O.S Computadores, S.O.S, Brasil.

Idiomas

Bandeira representando o idioma Inglês

Compreende Bem, Fala Bem, Lê Bem, Escreve Bem.

Bandeira representando o idioma Espanhol

Compreende Razoavelmente, Fala Pouco, Lê Razoavelmente, Escreve Pouco.

Bandeira representando o idioma Português

Compreende Bem, Fala Bem, Lê Bem, Escreve Bem.

Bandeira representando o idioma Alemão

Compreende Razoavelmente, Fala Razoavelmente, Lê Pouco, Escreve Pouco.

Áreas de atuação

Grande área: Ciências Exatas e da Terra / Área: Ciência da Computação.

Participação em eventos

the Knowledge Capture Conference. MAG: A Multilingual, Knowledge-base Agnostic and Deterministic Entity Linking Approach. 2017. (Congresso).

WI '17 - IEEE/WIC/ACM International Conference on Web Intelligence. GENESIS ? A Generic RDF Data Access Interface. 2017. (Congresso).

WI '17 - IEEE/WIC/ACM International Conference on Web Intelligence. LOG4MEX: A Library to Export Machine Learning Experiments. 2017. (Congresso).

ACL. 2016. (Congresso).

Semantics' 16. LIDIOMS - A Multilingual Linked Idioms Data Set. 2016. (Congresso).

Produções bibliográficas

  • MOUSSALLEM, DIEGO ; WAUER, MATTHIAS ; NGOMO, AXEL-CYRILLE NGONGA . Machine Translation using Semantic Web Technologies: A Survey. Journal of Web Semantics , v. 51, p. 1-19, 2018.

  • MOUSSALLEM, DIEGO ; CHOREN, RICARDO . Using Ontology-Based Context in the Portuguese-English Translation of Homographs in Textual Dialogues. International Journal of Artificial Intelligence & Applications (IJAIA) , v. 6, p. 17-33, 2015.

  • MOUSSALLEM, DIEGO ; USBECK, RICARDO ; Röder, Michael ; Ngonga Ngomo, Axel-Cyrille . Entity Linking in 40 Languages Using MAG. Lecture Notes in Computer Science. 1ed.: Springer International Publishing, 2018, v. , p. 176-181.

  • FERREIRA, T. C. ; MOUSSALLEM, DIEGO ; KADAR, A. ; WUBBEN, S. ; KRAHMER, E. . NeuralREG: An end-to-end approach to referring expression generation. In: the 56th Annual Meeting of the Association for Computational Linguistics, 2018, Melbourne. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018. p. 1959-1969.

  • FERREIRA, T. C. ; MOUSSALLEM, DIEGO ; KRAHMER, E. ; WUBBEN, S. . Enriching the WebNLG corpus. In: the 11th International Conference on Natural Language Generation, 2018, Tilburg. Proceedings of the 11th International Conference on Natural Language Generation, 2018. p. 171-176.

  • NGOMO, AXEL-CYRILLE NGONGA ; RÖEDER, MICHAEL ; MOUSSALLEM, DIEGO ; USBECK, RICARDO ; SPECK, R. . BENGAL: An Automatic Benchmark Generator for Entity Recognition and Linking. In: the 11th International Conference on Natural Language Generation, 2018, Tilburg. Proceedings of the 11th International Conference on Natural Language Generation, 2018. p. 339-349.

  • MOUSSALLEM, DIEGO ; FERREIRA, T. C. ; ZAMPIERI, M. ; CAVALCANTI, M. C. ; XEXEO, G. ; NEVES, M. ; NGOMO, AXEL-CYRILLE NGONGA . RDF2PT: Generating Brazilian Portuguese Texts from RDF Data. In: LREC - the Eleventh International Conference on Language Resources and Evaluation, 2018, Myazaki. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). France: European Language Resources Association (ELRA), 2018.

  • MOUSSALLEM, DIEGO ; MA Sherif ; ESTEVES, DIEGO ; ZAMPIERI, M. ; NGOMO, AXEL-CYRILLE NGONGA . LIdioms: A Multilingual Linked Idioms Data Set. In: LREC - the Eleventh International Conference on Language Resources and Evaluation, 2018, Myazaki. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). France: European Language Resources Association (ELRA), 2018.

  • MOUSSALLEM, DIEGO ; USBECK, RICARDO ; RÖEDER, MICHAEL ; NGOMO, AXEL-CYRILLE NGONGA . MAG. In: the Knowledge Capture Conference, 2017, Austin. Proceedings of the Knowledge Capture Conference on - K-CAP 2017. New York: ACM Press, 2017. p. 1.

  • ERMILOV, TIMOFEY ; MOUSSALLEM, DIEGO ; USBECK, RICARDO ; NGOMO, AXEL-CYRILLE NGONGA . GENESIS. In: the International Conference, 2017, Leipzig. Proceedings of the International Conference on Web Intelligence - WI '17. New York: ACM Press, 2017. p. 125.

  • ESTEVES, DIEGO ; MOUSSALLEM, DIEGO ; SORU, TOMMASO ; NETO, CIRO BARON ; LEHMANN, JENS ; NGOMO, AXEL-CYRILLE NGONGA ; DUARTE, JULIO CESAR . LOG4MEX. In: the International Conference, 2017, Leipzig. Proceedings of the International Conference on Web Intelligence - WI '17, 2017. p. 139.

  • ESTEVES, DIEGO ; MENDES, PABLO N. ; MOUSSALLEM, DIEGO ; DUARTE, JULIO CESAR ; ZAVERI, AMRAPALI ; LEHMANN, JENS . MEX Interfaces. In: the 12th International Conference, 2016, Leipzig. Proceedings of the 12th International Conference on Semantic Systems - SEMANTiCS 2016, 2016. p. 17.

  • MARX, EDGARD ; ZAVERI, AMRAPALI ; MOUSSALLEM, DIEGO ; RAUTENBERG, SANDRO . DBtrends. In: the 12th International Conference, 2016, Leipzig. Proceedings of the 12th International Conference on Semantic Systems - SEMANTiCS 2016, 2016. p. 9.

  • ESTEVES, DIEGO ; MOUSSALLEM, DIEGO ; NETO, CIRO BARON ; SORU, TOMMASO ; USBECK, RICARDO ; ACKERMANN, MARKUS ; LEHMANN, JENS . MEX vocabulary. In: the 11th International Conference, 2015, Vienna. Proceedings of the 11th International Conference on Semantic Systems - SEMANTICS '15. New York: ACM Press, 2015. p. 169.

  • SORU, TOMMASO ; MARX, EDGARD ; VALDESTILHAS, A. ; ESTEVES, DIEGO ; MOUSSALLEM, DIEGO ; PUBIO, G. . Neural Machine Translation for Query Construction and Composition. In: ICML workshop on Neural Abstract Machines & Program Induction v2 (NAMPI), 2018, Stockholm. International Conference on Machine Learning, 2018.

  • NETO, CIRO BARON ; SORU, TOMMASO ; ESTEVES, DIEGO ; MOUSSALLEM, DIEGO ; MARX, EDGARD ; VALDESTILHAS, A. . WASOTA: What are the states of the art?. In: Semantics' 16, 2016, Leipzig. 12th International Conference on Semantic Systems, 2016.

  • ESTEVES, DIEGO ; MOUSSALLEM, DIEGO ; LEHMANN, JENS . Interoperable Machine Learning Metadata using MEX. In: International Semantic Web Conference, 2015, Bethlehem, Pensylvania. The 14th International Semantic Web Conference, 2015.

Histórico profissional

Endereço profissional

  • Universität Paderborn, Heinz Nixdorf Institut, Data Science. , TP6.3.109 Technologiepark 6, Downtown, 33100 - Paderborn, - Alemanha, Telefone: (49) 5251605385, URL da Homepage:

Experiência profissional

2019 - Atual

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior

Vínculo: Celetista, Enquadramento Funcional: Senior Research Assistant, Regime: Dedicação exclusiva.

2015 - 2019

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior

Vínculo: Bolsista, Enquadramento Funcional: Pesquisador CNPq - GDE, Carga horária: 40, Regime: Dedicação exclusiva.

2012 - 2014

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior

Vínculo: Bolsista, Enquadramento Funcional: Pesquisador, Carga horária: 40, Regime: Dedicação exclusiva.

Outras informações:
Atuação em período integral no Instituto Militar de Engenharia (IME-RJ), realizando pesquisas no curso de mestrado de sistemas e computação nas área de Web Semântica, Maquinas de Tradução e Processamento de Linguagem Natural.