Skip to content
View cataluna84's full-sized avatar
:octocat:
Founder & CEO
:octocat:
Founder & CEO

Organizations

@Cohere-Labs-Community

Block or report cataluna84

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
cataluna84/README.md

Hi, I'm Mayank Bhaskar πŸ‘‹

Founder & CEO β€” OM Enterprises Β· AI Research & Engineering Labs Senior ML Engineer Β· AI Researcher Β· 14+ years building production AI systems

🌐 Portfolio: cataluna84.github.io β€” full project narrative, ARC-AGI work, Kaggle competitions, experience timeline πŸ“‚ Portfolio source: cataluna84/portfolio

Portfolio LinkedIn Kaggle HuggingFace X


πŸ”­ What I'm doing now

Building ubiquitous, interactive intelligence that runs on multimodal data (audio, text, image, video) with high throughput and low latency.

My research spans Computer Vision, State Space Models (S4 / S5 / Mamba), Multimodal Models, Reinforcement Learning, Mechanistic Interpretability, and LLM compression β€” with applied experience deploying on-device (iOS), on edge (DeepStream SDK), and at scale (CUDA/Triton).


πŸ† Highlights

πŸ₯ˆ ARC Prize 2024 24th / 1,427 teams β€” Silver Medal
πŸ₯‰ ARC Prize 2025 87th / 1,455 teams β€” solo Bronze Medal
πŸ”„ ARC Prize 2026 Active competitor
πŸ₯‰ HF Flax/JAX Sprint 3rd place β€” CLIP-RSICD (satellite images)
πŸ“Š Kaggle Competitions Expert Β· rank ~1,532 / 202,876 Β· 1 Silver + 3 Bronze across 21+ competitions
πŸ€— HuggingFace 14 models published (NER, text generation, image generation)

πŸš€ Featured projects

πŸ”¬ LID β€” Layer-Wise Multilingual Language ID β€” Layer-wise dynamics of multilingual language identification across 67 languages in compact foundation models. PyTorch Β· LoRA Β· Macro F1 0.97+

🧠 ARC Prize 2026 (ARC-AGI-3) β€” Public lab notebook for the $850K ARC-AGI-3 competition. Agent zoo, FORGE, BFS, CNN, frame segmenter.

🎡 Codec Fine-tuning β€” Tiny Aya β€” Benchmark for fine-tuning neural speech codecs (Mimi, DualCodec, Kanade) on low-resource languages with 8 optimizers + W&B Bayesian sweeps.

πŸ›‘οΈ Crosslingual Emergent Misalignment β€” Cohere Labs research on how safety guardrails degrade across high- and low-resource languages.

🎨 Generative Deep Learning β€” VAE, GAN, and Diffusion implementations for image synthesis.

🐍 Jamba β€” Hybrid Transformer–Mamba β€” PyTorch implementation exploring SSM + attention for infinite-context LMs.

🌍 World Models (Ha & Schmidhuber) β€” VAE + MDN-RNN + CMA-ES for RL in the OpenAI Car Racing environment.

β†’ Full project list with metrics, write-ups, and demos: cataluna84.github.io


🀝 Open source contributions

  • Google DeepMind β€” penzai (JAX neural network library)
  • TransformerLens β€” TransformerLens (mechanistic interpretability)
  • Kyutai Labs β€” moshi (open-source speech-text foundation model)

πŸ› οΈ Tech stack

Deep Learning & Research PyTorch JAX Mamba/S4/S5 Transformers CUDA Triton

ML Engineering & MLOps W&B HuggingFace Docker Quantization FlashAttention

Computer Vision & Edge DeepStream iOS CLIP

Languages & Tools Python C++ uv Linux Git


🌱 Community & leadership

  • Cohere Labs β€” Community Lead (Computer Vision sub-field) & Researcher
  • TWiML (This Week in ML) β€” Contributor Lead, 5+ years moderating weekly GenAI meetups
  • Lucknow AI Labs β€” Core organizer, Research Paper Team lead
  • Machine Learning Tokyo (MLT) β€” Contributor (2019–2023)
  • Yannic Kilcher Discord β€” Daily/weekly paper discussions

Conferences attended (remotely): ICLR Β· ICML Β· CVPR Β· NeurIPS


πŸ“ˆ GitHub stats

Mayank's GitHub stats Top Langs


πŸ“« Get in touch

Open to Senior ML Engineer Β· Research Engineer Β· CV Engineer Β· Senior SDE roles, research collaborations, and advisory work.

The portfolio site is the canonical source β€” this README is a GitHub-native summary that links back to it.

Pinned Loading

  1. arc-agi-3 arc-agi-3 Public

    Public lab notebook for the Kaggle ARC Prize 2026 - ARC-AGI-3 competition: agents, experiments, and tooling. Apache 2.0.

    Jupyter Notebook 2

  2. lid lid Public

    Layer-wise dynamics of multilingual language identification (67 languages) in compact foundation models β€” training, layer-wise inference, and optimization-strategy benchmarks.

    Jupyter Notebook 1

  3. rsk2327/Tiny-Aya-Under-the-hood rsk2327/Tiny-Aya-Under-the-hood Public

    This project investigates how Tiny Aya processes information across languages by analyzing how representations evolve across model layers.

    Jupyter Notebook 5 1

  4. Cohere-Labs-Community/crosslingual-emergent-misalignment Cohere-Labs-Community/crosslingual-emergent-misalignment Public

    A community research project investigating emergent misalignment in multilingual language models.

    Jupyter Notebook 5 2

  5. Generative_Deep_Learning Generative_Deep_Learning Public

    Generative Modeling

    Jupyter Notebook 3 1

  6. tiny-aya-simulatenous-translation/tinyaya-stage2-scale tiny-aya-simulatenous-translation/tinyaya-stage2-scale Public

    TinyAya Stage 2: TR↔HI speech-to-speech translation at scale

    Python 1 1