AI Specialist (AI Engineering)
Australia·Posted 1mo ago
web3python
<p>We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.</p> <p><strong>Responsibilities:</strong></p> <ul> <li>Compress and optimize large language and vision models for on-device inference.</li> <li>Develop pipelines for model distillation and hardware-specific compilation.</li> <li>Benchmark performance across various NPU/GPU architectures.</li> </ul> <p><strong>Qualifications:</strong></p> <ul> <li>Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.</li> <li>Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.</li> <li>Strong C++ and Python skills.</li> </ul> <p> </p> <p> </p>