Lead, Acceleration Kernel Developer
Tenstorrent
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
As the Acceleration Kernel Developer Lead at Tenstorrent, you will take on a pivotal role in guiding the optimization of low-level workloads, kernel development, and enhancing the performance of our software for machine learning applications. You will lead a team of highly skilled engineers, ensuring our software operates at peak efficiency and delivers high-quality results to our clients and users.
This role is on-site, based out of Bangalore, India.
Responsibilities:
- Kernel Development Leadership: Lead the design, development, and maintenance of kernel-level software components for our applications. Develop and optimize kernels and kernel libraries for efficient machine learning and HPC applications.
- Implementation of Tensor Compute and Data Movement Optimizations: Lead the implementation and optimization of tensor compute and tensor data movement kernels.
- Optimization Focus: Drive efforts in analyzing and optimizing low-level code to improve the performance and efficiency of our software, with a strong emphasis on tensor optimization.
- Machine Learning Integration: Collaborate with machine learning engineers and data scientists to integrate optimized kernels and low-level routines into machine learning frameworks and pipelines.
- Performance Profiling and Strategy Development: Identify performance bottlenecks, conduct performance profiling, and develop strategies to address and resolve them.
- Testing and Debugging: Oversee the creation of comprehensive unit tests, conduct thorough debugging, and ensure the stability and reliability of kernel-level code.
- Documentation and Best Practices: Create and maintain clear and concise documentation for code, APIs, and best practices to facilitate team collaboration.
- Research and Innovation: Stay abreast of the latest developments in kernel development, tensor optimization, and machine learning to propose and implement innovative solutions and improvements.
Experience & Qualifications:
- Bachelor’s degree in Computer Science, Software Engineering, or a related field.
- Proven experience in kernel development, with a strong focus on low-level optimizations and tensor optimization.
- Proficiency in C/C++ programming languages.
- Familiarity with machine learning frameworks and concepts.
- Strong problem-solving skills and the ability to analyze and debug complex issues.
- Experience with performance profiling and optimization tools.
- Excellent communication and leadership skills.
- Self-motivated, detail-oriented, and able to work independently as well as lead a team.
- Experience with GPU programming (CUDA, OpenCL) is a plus.
- Knowledge of operating system internals is a plus.
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
Due to U.S. Export Control laws and regulations, Tenstorrent is required to ensure compliance with licensing regulations when transferring technology to nationals of certain countries that have been licensing conditions set by the U.S. government.
As this position will have direct and/or indirect access to information, systems, or technologies that are subject to U.S. Export Control laws and regulations, please note that citizenship/permanent residency, asylee and refugee information and supporting documentation will be required and considered as a condition of employment.
If a U.S. export license is required, employment will not begin until a license with acceptable conditions is granted by the U.S. government. If a U.S. export license with acceptable conditions is not granted by the U.S. government, then the offer of employment will be rescinded.