(Staff/ Sr. Staff) Machine Learning Engineer

Omnivision Technologies Singapore · Singapore

Sector
AI
Function
Product & Engineering
Level
Mid-Level
Employment type
Full Time
Posted
2026-06-19
Source
mycareersfuture

Description:Lead applied research and strategic definition of machine learning algorithms, quantization methodologies, and toolchain capabilities for the Neural Network Development Kit (NDK) roadmap targeting next-generation Edge AI compute solutionsDrive innovation at the intersection of ML algorithms and constrained hardware environments, identifying and validating the latest Edge AI technologies applicable to product requirementsServe as the primary technical interface between the AI Architecture team and the Software (Edge AI) team, delivering well-researched toolchain feature proposals and algorithmic specifications for implementationCollaborate with Software (Edge AI) and AI Architecture teams to identify and pursue targeted improvements in ML software methodology, and support Software-initiated improvement efforts with algorithmic insight and implementation guidanceMaintain deep engagement with the global Edge AI research community to ensure the NDK roadmap reflects the state of the art in model efficiency, compression, and on-device learningKey Responsibilities:Collaborate with the Sr. Design Manager (AI Architect) to define and maintain the NDK toolchain feature roadmap, ensuring alignment with the NPU hardware roadmap and overall AI product strategyResearch, evaluate, and recommend quantization algorithms, pruning strategies, knowledge distillation techniques, and other model compression methodologies suited to constrained hardware targetsAssemble and lead focused task forces drawing on partial bandwidth from the Software (Edge AI) and IC Design teams to prototype, benchmark, and validate proposed toolchain concepts before broader commitmentPrototype and benchmark candidate ML algorithms and toolchain features to quantitatively demonstrate accuracy-performance trade-offs and justify roadmap prioritizationTranslate hardware architectural capabilities and constraints (as defined by the NPU Architect) into concrete toolchain feature requirements and algorithmic optimization opportunitiesDeliver comprehensive technical specifications and algorithmic documentation to the Software (Edge AI) team to enable confident and accurate implementation of NDK featuresCollaborate closely with the Software (Edge AI) team throughout the implementation phase to resolve algorithmic questions, validate correctness of implementations, and ensure performance targets are metActively monitor and synthesize developments from the Edge AI research community — including publications, open-source frameworks, and industry benchmarks — to continuously inform and refresh the NDK roadmapPartner with the Software (Edge AI) team to jointly identify ML toolchain methodology improvement opportunities and drive those that originate from the AI Architecture team; provide expert advisory support for methodology improvements initiated by the Software teamEvaluate and apply a range of productivity tools and techniques — including but not limited to AI-assisted methods — to accelerate algorithmic prototyping, benchmarking, and specification productivityEvaluate and integrate relevant open-source ML frameworks, runtimes, and toolchain components (e.g., MLIR, TVM, ONNX Runtime) as acceleration vectors for NDK developmentRequirements:Bachelor's or Master's degree in Computer Science, Electrical Engineering, Computer Engineering, or related technical field; PhD preferred, particularly in machine learning, optimization, or computer architecture8+ years of experience in applied machine learning engineering or Edge AI software, with at least 5 years focused on model optimization, ML compilers, or on-device inference toolchain developmentProven expertise in quantization (PTQ, QAT), pruning, knowledge distillation, and other model compression techniques with demonstrated results on resource-constrained hardwareStrong knowledge of AI/ML algorithms, neural network architectures (CNNs, RNNs, Transformers, etc.), and the trade-offs between model accuracy, computational complexity, and memory footprintDemonstrated ability to stay at the forefront of the Edge AI research community, with a track record of translating academic and industry advances into practical product roadmap contributionsHands-on experience with mainstream ML frameworks (PyTorch, TensorFlow/Lite) and familiarity with ML compiler stacks such as MLIR, TVM, or ONNX RuntimeExperience consuming hardware architectural specifications and translating them into software toolchain requirements and algorithmic optimizationsExcellent communication skills with ability to present complex research findings and toolchain proposals clearly to architecture, software, and executive audiencesStrong analytical and problem-solving abilities with emphasis on quantitative benchmarking, accuracy-efficiency trade-off analysis, and performance profiling on target hardwareDemonstratedability to work collaboratively across team boundaries, including assembling and coordinating cross-functional task forces without direct authorityFamiliarity with RISC-V ISA and its software ecosystem, particularly in the context of AI inference deploymentExperience with FPGA-based or simulator-based prototyping to validate algorithmic concepts against pre-silicon hardware models (preferred but not required)Self-motivated with ability to work independently, lead applied research initiatives, and drive toolchain innovation from algorithmic exploration through specification and successful team handoff

Apply on mycareersfuture →
AI Computer Engineering Compiler Android NDK Computer Science Adobe Edge Animate hardware acceleration Providing expert advice