Coder Social home page Coder Social logo

Dr. Tatiana Likhomanenko

Research scientist and software developer.
Semi-supervised and unsupervised learning, speech recognition.
Gravitating to core ML, video processing, and private federated learning.

New: I am looking for interns with hands-on experience in multimodal generative models for 2024 year, please email me directly with your resume.

Github Google Scholar

Industry and Research Experience
  • Apple, Staff Research Scientist (Oct 2023 - present)
  • Apple, Senior Research Scientist (Sep 2021 - Oct 2023)
  • Fundamental AI Research, Postdoctoral Researcher (Aug 2019 - Aug 2021)
    Speech recognition and natural language processing for speech
    Advisors: Ronan Collobert, Gabriel Synnaeve
  • Fundamental AI Research, AI Resident (Sep 2018 - Aug 2019)
    Speech recognition and natural language processing for speech
    Advisors: Ronan Collobert, Gabriel Synnaeve
  • NTechLab, Machine Learning Expert (Aug 2017 - Sep 2018)
    Face recognition and facial attributes predictions with deep learning at top-1 face recognition team
  • Yandex & CERN, Researcher (Apr 2013 - May 2017)
    Machine learning for High Energy Physics studies at the Large Hadron Collider: particle identification system, trigger system (online identification which collisions worth being stored), specific rare decays search (high-level data analysis), and B mesons oscillations (main subject of the LHCb studies)
  • Membership at Large Hadron Collider beauty (LHCb) collaboration, CERN (2013 - 2018)
Education
Software
  • mlx-data: framework agnostic data loading library brought to you by Apple machine learning research; it works with PyTorch, Jax or MLX
  • Flashlight: a fast, flexible machine learning library written entirely in C++
    blog post
  • Wav2letter++: speech recognition toolkit and recipes for papers
  • BDT reweigter tutorial
  • HepML: specific machine learning tools for purposes of high energy physics
  • REP: ipython-based environment for conducting data-driven research in a consistent and reproducible way
Public Talks
Selected Publications

Private Federated Learning

  • Pelikan*, M., Azam, S.S., Feldman, V., Silovsky, J., Talwar, K. and Likhomanenko*, T. Federated Learning with Differential Privacy for End-to-End Speech Recognition, 2023. arXiv preprint arXiv:2310.00098. Under review.
  • Azam*, S.S., Pelikan*, M., Feldman, V., Talwar, K., Silovsky, J. and Likhomanenko*, T. Federated Learning for Speech Recognition: Revisiting Current Trends Towards Large-Scale ASR. In International Workshop on Federated Learning in the Age of Foundation Models in Conjunction with NeurIPS 2023. Oral
    overview, video, slides, poster
  • Azam, S.S., Likhomanenko, T., Pelikan, M. and Silovsky, J. Importance of Smoothness Induced by Optimizers in FL4ASR: Towards Understanding Federated Learning for End-to-End ASR, ASRU 2023.

Machine Learning

  • Busbridge*, D., Ramapuram*, J., Ablin*, P., Likhomanenko*, T., Dhekane, E.G., Suau, X. and Webb, R. How to Scale Your EMA. Thirty-Seventh Conference on Neural Information Processing Systems (NeurIPS), 2023. Spotlight.
    overview, video, slides, poster
  • Zhai*, S., Likhomanenko*, T., Littwin*, E., Busbridge*, D., Ramapuram*, J., Zhang, Y., Gu, J. and Susskind, J. Stabilizing Transformer Training by Preventing Attention Entropy Collapse. In International Conference on Machine Learning (ICML), 2023.
    overview, video, poster, code
  • Gheini, M., Likhomanenko, T., Sperber, M. and Setiawan, H. Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data. ACL Findings, 2023.
    overview
  • Zhai, S., Jaitly, N., Ramapuram, J., Busbridge, D., Likhomanenko, T., Cheng, J.Y., Talbott, W., Huang, C., Goh, H. and Susskind, J.M. Position Prediction as an Effective Pretraining Strategy. In International Conference on Machine Learning (ICML), 2022, pp. 26010-26027. PMLR. (Spotlight)
    overview, video, poster
  • Kahn, J.D., Pratap, V., Likhomanenko, T., Xu, Q., Hannun, A., Cai, J., Tomasello, P., Lee, A., Grave, E., Avidov, G., Steiner, B., Liptchinsky, V., Synnaeve, G., Collobert, R. Flashlight: Enabling Innovation in Tools for Machine Learning. In International Conference on Machine Learning (ICML), 2022, pp. 10557-10574. PMLR. (Spotlight)
    video, presentation, poster, code
  • Likhomanenko, T., Xu, Q., Synnaeve, G., Collobert, R. and Rogozhnikov, A. CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings. Thirty-Fifth Conference on Neural Information Processing Systems (NeurIPS), 2021.
    openreview, video, presentation, code
  • Rogozhnikov, A., Likhomanenko, T. InfiniteBoost: building infinite ensembles with gradient descent. arXiv preprint arXiv:1706.01109. 2017.

Automatic Speech Recognition

2023

  • Rouditchenko, A., Collobert, R. and Likhomanenko, T., AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition, 2023. arXiv preprint arXiv:2309.17395. Under review.
  • Likhomanenko, T., Lugosch, L. and Collobert, R. Unsupervised ASR via Cross-Lingual Pseudo-Labeling, 2023. arXiv preprint arXiv:2305.13330. Under review.
  • Berrebbi, D., Collobert, R., Jaitly, N., Likhomanenko, T. More Speaking or More Speakers?. ICASSP 2023.
    overview
  • Berrebbi, D., Collobert, R., Bengio, S., Jaitly, N., Likhomanenko, T. Continuous Pseudo-Labeling from the Start. ICLR 2023.
    overview, video, slides, poster
2022

  • Likhomanenko, T., Collobert, R., Jaitly, N., Bengio, S. Continuous Soft Pseudo-Labeling in ASR. I Can’t Believe It’s Not Better Workshop at NeurIPS 2022.
    video, poster
  • Lugosch, L., Likhomanenko, T., Synnaeve, G. and Collobert, R. Pseudo-Labeling for Massively Multilingual Speech Recognition. ICASSP 2022.
    blog post, code
  • Pratap, V., Xu, Q., Likhomanenko, T., Synnaeve, G. and Collobert, R. Word Order Does Not Matter For Speech Recognition. ICASSP 2022.
2021

  • Manohar, V., Likhomanenko, T., Xu, Q., Hsu, W.N., Collobert, R., Saraf, Y., Zweig, G. and Mohamed, A., 2021. Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition. ASRU 2021.
  • Likhomanenko, T., Xu, Q., Kahn, J., Synnaeve, G. and Collobert, R. slimIPL: Language-model-free iterative pseudo-labeling. Interspeech 2021.
    video, poster, code
  • Likhomanenko*, T., Xu*, Q., Pratap*, V., Tomasello, P., Kahn, J., Avidov, G., Collobert, R. and Synnaeve, G. Rethinking evaluation in asr: Are our models robust enough? Interspeech 2021.
    video, poster, code
  • Hsu, W.N., Sriram, A., Baevski, A., Likhomanenko, T., Xu, Q., Pratap, V., Kahn, J., Lee, A., Collobert, R., Synnaeve, G. and Auli, M., 2021. Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training. Interspeech 2021.
  • Xu, Q., Baevski, A., Likhomanenko, T., Tomasello, P., Conneau, A., Collobert, R., Synnaeve, G. and Auli, M., 2021, June. Self-training and pre-training are complementary for speech recognition. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 3030-3034). IEEE.
    video
  • Talnikar, C., Likhomanenko, T., Collobert, R. and Synnaeve, G., 2021, June. Joint masked cpc and ctc training for asr. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 3045-3049). IEEE.
    video, poster, presentation
2020

  • Xu, Q., Likhomanenko, T., Kahn, J., Hannun, A., Synnaeve, G. and Collobert, R., 2020. Iterative Pseudo-Labeling for Speech Recognition. Proc. Interspeech 2020, pp.1006-1010.
    video, code
  • Pratap, V., Xu, Q., Kahn, J., Avidov, G., Likhomanenko, T., Hannun, A., Liptchinsky, V., Synnaeve, G., Collobert, R. (2020) Scaling Up Online Speech Recognition Using ConvNets. Proc. Interspeech 2020, 3376-3380.
    video, blog post, news
  • Kahn, J., Rivière, M., Zheng, W., Kharitonov, E., Xu, Q., Mazaré, P.E., Karadayi, J., Liptchinsky, V., Collobert, R., Fuegen, C. and Likhomanenko, T., 2020, May. Libri-light: A benchmark for asr with limited or no supervision. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 7669-7673). IEEE.
    presentation, blog post, code
  • Synnaeve*, G., Xu*, Q., Kahn*, J., Likhomanenko*, T., Grave*, E., Pratap, V., Sriram, A., Liptchinsky, V. and Collobert, R. End-to-end asr: from supervised to semi-supervised learning with modern architectures. SAS Workshop ICML 2020.
    video, code
2019

  • Likhomanenko, T., Synnaeve, G. and Collobert, R., 2019. Who Needs Words? Lexicon-Free Speech Recognition. Proc. Interspeech 2019, pp.3915-3919.
    presentation, blog post, code

Machine Learning in High Energy Physics

  • Derkach, D., Hushchyn, M., Likhomanenko, T., Rogozhnikov, A., Kazeev, N., Chekalina, V., Neychev, R., Kirillov, S., Ratnikov, F. and LHCb collaboration. Machine-Learning-based global particle-identifiritcation algohms at the LHCb experiment. Journal of Physics: Conference Series. 2018. Vol. 1085. No. 4. P. 1-5.
    ACAT 2017, poster
  • Likhomanenko, T., Derkach, D., Rogozhnikov, A. Inclusive Flavour Tagging Algorithm. Journal of Physics: Conference Series, 2016.
    ACAT 2016, poster, code
  • LHCb collaboration (2016). Search for decays of neutral beauty mesons into four muons, JHEP 03 (2017) 001.
  • Likhomanenko, T., Ilten, P., Khairullin, E., Rogozhnikov, A., Ustyuzhanin, A., Williams, M. LHCb Topological Trigger Reoptimization. Journal of Physics: Conference Series, 2015.
    CHEP 2015, presentation, code
  • CMS collaboration, LHCb collaboration. Observation of the rare Bs0→ μ+ μ− decay from the combined analysis of CMS and LHCb data. Nature, 2015.
  • Likhomanenko, T., Rogozhnikov, A., Baranov, A., Khairullin, E., & Ustyuzhanin, A. Reproducible Experiment Platform. Journal of Physics: Conference Series (Vol. 664, No. 5, p. 052022).
    CHEP 2015, poster
  • LHCb collaboration. Search for the lepton flavour violating decay τ−→ μ− μ+ μ−. Journal of High Energy Physics, 2015.
  • Likhomanenko, T., Rogozhnikov, A., Baranov, A., Khairullin, E., Ustyuzhanin, A. Improving reproducibility of data science experiments, ICML 2015 AutoML Workshop, 2015
    poster spotlight

Partial Differential Equations (Ph.D.)

  • Moiseev, E.I., Likhomanenko, T.N. Eigenfunctions of the Gellerstedt problem with an inclined-type change line. Integral Transforms and Special Functions, 2017, pp. 1–8.
  • Moiseev E. I., Likhomanenko T. N. On the basis property of a two-part trigonometric series. Doklady Mathematics, 2016, Vol. 94, No. 1, pp. 1–4.
    oral talk, International scientific conference Actual Problems in Theory of Partial Differential Equations, dedicated to the centenary of Andrey V. Bitsadze, 2016
  • Moiseev, E.I., Likhomanenko, T.N. Eigenfunctions of the Tricomi problem with an inclined type change line. Differential Equations, 2016, Vol. 52, No. 10, pp 1323– 1330.
    oral talk, International scientific conference Actual Problems in Theory of Partial Differential Equations, dedicated to the centenary of Andrey V. Bitsadze, 2016
  • Moiseev, E.I., Likhomanenko, T.N. On the basis property of a trigonometric system arising in the Frankl problem. Differential Equations, 2013, Vol. 49, No. 3, pp. 325–331.
    oral talk, AMEE-2013 and Lomonosov-2013
  • Moiseev E.I., Likhomanenko T.N. A nonlocal boundary value problem for the Lavrent’ev-Bitsadze equation. Doklady Mathematics, 2012, Vol. 86, No. 2, pp. 635–637.
    oral talk, AMEE-2012 and Lomonosov-2012
Teaching
Research Activities

Serving as Reviewer

  • Transactions on Machine Learning Research (TMLR)
  • Journal of Artificial Intelligence Research
  • NeurIPS 2021, 2022 (top-8% reviewer), 2023 (top-8% reviewer)
  • ICLR 2021, 2022 (highlighted reviewer), 2023, 2024
  • ICLR Blogposts 2023, 2024
  • ICML 2022, 2023
  • Interspeech 2020, 2021, 2022, 2023 (top-2% reviewer)
  • ICASSP 2021, 2022, 2023 (outstanding reviewer), 2024
  • Machine Learning and the Physical Sciences workshop NeurIPS 2019, 2020, 2022, 2023
  • SynS and ML Workshop ICML 2023
  • Vision-based InduStrial InspectiON (VISION) Workshop CVPR 2023
  • CHIME 2023
  • BayLearn 2022, 2023
  • An advisor in the LHCb statistics and machine learning working group (2016-2017)

Serving as Area Chair

  • NeurIPS Datasets and Benchmarks 2023
  • Vision-based InduStrial InspectiON (VISION) Workshop CVPR 2023

Mentorship

  • WiML, Research Mentorship, NeurIPS, New Orleans (2023)
  • LatinX in AI, Mentorship Hour (Panel), ICML, Honolulu (2023)
  • LatinX in AI, CV Research workshop, CVPR, New Orlean (2022)

Panels

  • Failure Modes in the Age of Foundation Models, workshop "I Can’t Believe It’s Not Better (ICBINB): Failure Modes in the Age of Foundation Models", NeurIPS, New Orleans (2023)
  • Mentorship Hour, LatinX in AI, ICML, Honolulu (2023)
  • On-Device Workshop MLSys, Miami (2023)

Organizer

Kaggle Competition "Flavours of Physics"

Advising

  • Zijin Gu, AI/ML Resident, Apple 2023-2024 (co-advising with Navdeep Jaitly)
  • Andrew Rouditchenko, summer internship, Apple, 2023
  • Lingxiao Zhao, summer internship, Apple, 2023 (co-advising)
  • Chun-wei Ho, summer internship, Apple, 2023 (co-advising with Navdeep Jaitly and Ronan Collobert)
  • Sheikh Shams Azam, AI/ML Resident, Apple 2022-2023 (co-advising with Honza Silovsky)
  • Dan Berrebbi, summer internship, Apple, 2022
  • Mozhdeh Gheini, summer internship, Apple, 2022 (co-advising with Matthias Sperber and Hendra Setiawan); Apple, 2023
  • Colby Bunbary, summer internship, Apple, 2022 (co-advising)
  • Loren Lugosch: summer internship, Facebook AI Reserch, 2021 (co-advising with Ronan Collobert and Gabriel Synnaeve); summer internship, Apple (co-advising with Ronan Collobert), 2022
  • Chaitanya Talnikar, AI Residency 2019-2020 (co-advising with Ronan Collobert and Gabriel Synnaeve)
In News
Honors & Awards
  • Winner of Accelerate your code international competition, Intel (2012)
  • Best student of Computer Science faculty, Lomonosov Moscow State University (2012)
  • The winner (Regional stage) of All-Russian Programming contest (2007, 2008)

Tatiana Likhomanenko's Projects

detectron icon detectron

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

fairseq icon fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

hep_ml icon hep_ml

Machine learning algorithms for high energy physics.

lightgbm icon lightgbm

A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.

mlatgraddays icon mlatgraddays

Machine Learning session on Grad Days at Heidelberg university 2017

mlatimperial2017 icon mlatimperial2017

Materials for the course of machine learning at Imperial College organized by YSDA

mlx-data icon mlx-data

Efficient framework-agnostic data loading

rep icon rep

Machine Learning toolbox for Humans

wav2letter icon wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.