Skip to content
View Ashutosh0x's full-sized avatar
⚑
⚑

Block or report Ashutosh0x

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Ashutosh0x/README.md

Ashutosh Kumar Singh

Software and Security @ Skyhigh Security Β· ML Systems Researcher Β· NeurIPS 2026 Author


πŸ”¬ What I Do

I build secure, scalable systems at Skyhigh Security and contribute to the reliability of production ML infrastructure across Apple, Google, and OpenAI. My current research focus is numerical stability in half-precision inference on neural accelerators β€” work that led to a NeurIPS 2026 workshop paper submission and fixes deployed across Apple's entire ML compiler stack.


🍎 Featured: Apple ML Framework Contributions

4 PRs Β· 6 Issues Β· 3 Apple repos Β· 21 production models protected

I discovered a systematic class of fp16 overflow failures on Apple Neural Engine that silently produce incorrect outputs in operations like softplus, mish, and logsumexp. I derived mathematically equivalent, overflow-proof decompositions and submitted fixes across Apple's ML stack:

Repository Contribution Status
apple/coremltools PRs #2725, #2726, #2727 β€” Stable decompositions for 5 ops βœ… Approved & merge-ready
apple/coreai-torch PR #22 β€” First external fp16 fix on WWDC 2026 framework 🟑 Under review
apple/coreai-optimization Issue #7 β€” FP16 casting Γ— quantization compound vulnerability 🟑 Reported

Impact: Fixes protect all 21 Core AI production model recipes (Gemma3, Qwen3, Mistral, Whisper, YOLO, SAM3, Stable Diffusion, etc.) from silent numerical failures on 2.5B+ Apple Silicon devices.


πŸ† Open-Source Track Record

Project What I Did Impact
Google DeepMind (Chex) Modernized JAX sharding detection for JAX 0.8.x Merged by DeepMind maintainer
OpenAI Codex Fixed critical Windows TUI authentication bug Merged into official repo
Tenstorrent ($1,500 bounty) Ported Depth Anything V2 Large to TTNN (Wormhole B0) PCC 0.9983, bounty completed
Google ADK-Python Fixed path traversal / Zip Slip vulnerability (CWE-22) Merged β€” prevented RCE
TensorFlow Fixed 3 memory safety vulns (heap OOB, corruption, CHECK failures) Merged β€” prevented crashes in prod ML
Bazel Implemented TLS error fail-over in Repository Downloader Merged into official repo
Stripe Issue investigation and resolution in StripeJS Resolved

πŸ“„ Research

Compiler-Level Numerical Stabilization for Half-Precision Inference on Neural Accelerators NeurIPS 2026 Workshop Submission Β· Ashutosh Kumar Singh

Read Paper

Cross-framework audit of Apple's entire ML stack (5 frameworks Γ— 5 operations Γ— 21 models). Derived overflow-free algebraic decompositions validated on M3 Max and M5 silicon with zero regressions.


πŸ›  Tech Stack

python cplusplus javascript typescript go java react nextjs
aws gcp azure docker kubernetes terraform tensorflow deepmind

πŸŽ–οΈ Certifications

Cloud Architect
GCP Professional
Cloud Architect
Cloud Security
GCP Cloud
Security
Cloud Digital Leader
Cloud Digital
Leader
Azure AZ-900
Azure AZ-900
SC-900
Microsoft SC-900
GCP Fly Cup
GCP Fly Cup
LeetCode 50 Days
LeetCode 50

πŸ“Š Stats

stats graph languages graph

Open to opportunities in ML Systems, Numerical Computing, and Infrastructure
Let's connect β†’

Snake animation

Pinned Loading

  1. Ai-Trip-Planner Ai-Trip-Planner Public

    AI Trip Planner: Flutter app with Firebase (Auth, Firestore, Storage, Messaging), Stripe payments, Maps, voice, TTS, biometrics, and localization.

    Dart 2 1

  2. gemini-code-review-bot gemini-code-review-bot Public

    A fully automated AI code-review system that runs inside GitHub CI/CD, analyzes pull requests using Gemini, and posts precise, line-level review comments.

    JavaScript

  3. datapulse datapulse Public

    DataPulse: AI-Driven Autonomous Incident Response platform. Built with Elastic Agent Builder, ES|QL, and ELSER for high-fidelity RCA and remediation.

    JavaScript

  4. rust-finance rust-finance Public

    A high-performance, ultra low-latency trading terminal and AI-infused daemon built completely in Rust.

    Rust 366 117

  5. tunix-gemma-reasoning tunix-gemma-reasoning Public

    Training Gemma3-1B to produce structured reasoning traces using Tunix (SFT + GRPO) on Kaggle TPUs. Google Tunix Hackathon submission.

    Jupyter Notebook 1

  6. neuralpulse-wear neuralpulse-wear Public

    NeuralPulse: Advanced Wear OS 7 & Android 16 Health Ecosystem for Samsung Galaxy & Android devices. Connects with Wear OS smartwatches to stream real-time biometrics, apply edge-native digital sign…

    Kotlin 1