I am a Postdoc at The University of Edinburgh Institute for Languages, Cognition and Computation, with Kenneth Heafield. I completed my PhD in the same institute with Adam Lopez and Kenneth Heafield.
I like languages, logographic writing systems, game theory and GPUs. I (try to) make things run faster and enjoy (premature) optimization. In my spare time I learn languages and play football. Please check out my blog where I post random things about Chinese characters, code and life.
My research lies at the intersection of high performance computing and NLP. I am interested in GPGPU, low level CPU GEMM routines, NLP, in particular Neural Machine Translation, transformers, word embeddings, transfer learning and multilinguality. Lately, I have been working on optimising Neural Machine Translation inference, both on CPUs and GPUs.
I am an avid supporter and believer in Open Source software and have contributed to various projects, among which:
- Various contributions to the marian machine translation framework.
- translateLocally a cross-platform desktop offline machine translation software.
- OpusCleaner and OpusTrainer modern machine translation data cleaner and trainer, built together with Jelmer van der Linde.
- Privacy focussed machine translation with Firefox.
- gemmBench a benchmark framework for various low precision GEMM frameworks.
- Collaborator on intgemm an 8 and 16bit intger GEMM framework by Kenneth Heafield.
- bfTile, an experimental VNNI GEMM tiling.
- imageSelector a cross-platform photo library organiser.
- gLM a GPU n-gram language model.
- ProbingPT a probing phrase table for statistical machine translation.
Laurie Burchell, Alexandra Birch, Nikolay Bogoychev, Kenneth Heafield An Open Dataset and Model for Language Identification In Proceedings of Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, Toronto, Canada, 2023 [pdf] [bib]
Ramon Sanabria, Nikolay Bogoychev, Nina Markl, Andrea Carmantini, Ondrej Klejch, Peter Bell The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023 [pdf] [bib]
Mikko Aulamo, Nikolay Bogoychev, Shaoxiong Ji, Graeme Nail, Gema Ramírez-Sánchez, Jörg Tiedemann, Jelmer van der Linde, Jaume Zaragoza HPLT: High Performance Language Technologies In Proceedings of Proceedings of the 24th Annual Conference of the European Association for Machine Translation, Tampere, Finland, 2023 [pdf] [bib]
Nikolay Bogoychev, Biao Zhang, Maximiliana Behnke, Graeme Nail, Jelmer van der Linde, Sidharth Kashyap, Kenneth Heafield Edinburgh’s Submission to the WMT 2022 Efficiency Task In Proceedings of the Seventh Conference on Machine Translation (WMT), Abu Dhabi, United Arab Emirates [pdf] [bib] [poster]
Kenneth Heafield, Biao Zhang, Graeme Nail, Jelmer Van Der Linde, Nikolay Bogoychev Findings of the WMT 2022 Shared Task on Efficient Translation In Proceedings of the Seventh Conference on Machine Translation (WMT), Abu Dhabi, United Arab Emirates [pdf] [bib] [presentation]
Andreas Grivas, Nikolay Bogoychev, Adam Lopez Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in Practice In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, Dublin, Ireland [pdf] [bib] [code] [demo]
Nikolay Bogoychev Not all parameters are born equal: Attention is mostly what you need In Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, Punta Cana, Dominican Republic [pdf] [bib] [blog] [poster]
Nikolay Bogoychev and Pinzhen Chen The Highs and Lows of Simple Lexical Domain Adaptation Approaches for Neural Machine Translation In Proceedings of the Second Workshop on Insights from Negative Results in NLP, Punta Cana, Dominican Republic [pdf] [bib] [blog] [presentation] [slides]
Nikolay Bogoychev, Jelmer Van der Linde, Kenneth Heafield TranslateLocally: Blazing-fast translation running on the local CPU Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Punta Cana, Dominican Republic [pdf] [bib] [blog] [demo]
Maximiliana Behnke, Nikolay Bogoychev, Alham Fikri Aji, Kenneth Heafield, Graeme Nail, Qianqian Zhu, Svetlana Tchistiakova, Jelmer van der Linde, Pinzhen Chen, Sidharth Kashyap and Roman Grundkiewicz Efficient Machine Translation with Model Pruning and Quantization In Proceedings of the Sixth Conference on Machine Translation (WMT21), Punta Cana, Dominican Republic [pdf] [bib]
Pinzhen Chen, Jindřich Helcl, Ulrich Germann, Laurie Burchell, Nikolay Bogoychev, Antonio Valerio Miceli Barone, Jonas Waldendorf, Alexandra Birch and Kenneth Heafield The University of Edinburgh’s English-German and English-Hausa Submissions to the WMT21 News Translation Task In Proceedings of the Sixth Conference on Machine Translation (WMT21), Punta Cana, Dominican Republic [pdf] [bib]
Pinzhen Chen, Nikolay Bogoychev, Ulrich Germann Character Mapping and Ad-hoc Adaptation: Edinburgh’s IWSLT 2020 Open Domain Translation System. In Proceedings of the 17th International Conference on Spoken Language Translation, Seattle, USA [pdf] [bib]
Nikolay Bogoychev, Roman Grundkiewicz, Alham Fikri Aji, Maximiliana Behnke, Kenneth Heafield, Sidharth Kashyap, Emmanouil-Ioannis Farsarakis, Mateusz Chudyk Edinburgh’s Submissions to the 2020 Machine Translation Efficiency Task. In Proceedings of the Fourth Workshop on Neural Generation and Translation, Seattle, USA [pdf] [bib]
Pinzhen Chen, Nikolay Bogoychev, Kenneth Heafield, Faheem Kirefu Parallel Sentence Mining by Constrained Decoding. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Seattle, USA [pdf] [bib]
Alham Fikri Aji, Nikolay Bogoychev, Kenneth Heafield In Neural Machine Translation, What Does Transfer Learning Transfer? In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Seattle, USA [pdf] [bib]
Young Jin Kim, Marcin Junczys-Dowmunt, Hany Hassan, Alham Fikri Aji, Kenneth Heafield, Roman Grundkiewicz, Nikolay Bogoychev From Research to Production and Back: Ludicrously Fast Neural Machine Translation. In Proceedings of the 3rd Workshop on Neural Generation and Translation, Hong Kong. [pdf] [bib]
Rachel Bawden, Nikolay Bogoychev, Ulrich Germann, Roman Grundkiewicz, Faheem Kirefu, Antonio Valerio Miceli Barone, Alexandra Birch The University of Edinburgh's Submissions to the WMT19 News Translation Task. In Proceedings of the Fourth Conference on Machine Translation, Florence, Italy [pdf] [bib]
Lushi Chen, Abeer Aldayel, Nikolay Bogoychev, Tao Gong Similar Minds Post Alike: Assessment of Suicide Risk Using a Hybrid Model. In Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology, Minneapolis, Minnesota, USA. [pdf] [bib]
Nikolay Bogoychev, Marcin Junczys-Dowmunt, Kenneth Heafield, Alham Fikri Aji Accelerating Asynchronous Stochastic Gradient Descent for Neural Machine Translation. In Proceedings of EMNLP, Brussels, Belgium. [pdf] [bib]
Barry Haddow, Nikolay Bogoychev, Denis Emelin, Ulrich Germann, Roman Grundkiewicz, Kenneth Heafield, Antonio Valerio Miceli Barone, Rico Sennrich The University of Edinburghs Submissions to the WMT18 News Translation Task In Proceedings of the Third Conference on Machine Translation, Brussels, Belgium. [pdf] [bib]
Marcin Junczys-Dowmunt, Roman Grundkiewicz, Tomasz Dwojak, Hieu Hoang, Kenneth Heafield, Tom Neckermann, Frank Seide, Ulrich Germann, Alham Fikri Aji, Nikolay Bogoychev, André F. T. Martins, Alexandra Birch (2018). Marian: Fast Neural Machine Translation in C++. Proceedings of ACL, Sydney, Australia, System Demonstrations. [pdf] [bib]
Barry Haddow, Matthias Huck, Alexandra Birch, Nikolay Bogoychev, Philipp Koehn The Edinburgh/JHU Phrase-based Machine Translation Systems for WMT 2015. In Proceedings of WMT, Lisboa, Portugal. [pdf] [bib]
Alexandra Birch, Matthias Huck, Nadir Durrani, Nikolay Bogoychev, Philipp Koehn Edinburgh SLT and MT System Description for the IWSLT 2014 Evaluation. In Proceedings of IWSLT, Lake Tahoe, USA. [pdf] [bib]
2014: Received a First class Bachelor's degree in Artificial Intelligence & Computer science from The University of Edinburgh