logo inner

Workload Optimization Engineer

TenstorrentUnited StatesOnsite

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
We are seeking a highly skilled CPU Workload Performance Optimization Engineer to drive the analysis, tuning, and optimization of CPU workloads for cutting-edge processor architectures. In this role, you will work closely with architects, hardware designers, and software engineers to analyze performance bottlenecks, improve efficiency, and influence future CPU designs. Your expertise will directly impact the performance of next-generation computing platforms across diverse workloads.This role is Remote, based out of The United States.We welcome candidates at various experience levels for this role.

During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.

Responsibilities:


  • Conduct competitive analysis for latest CPU products with industry-standard and emerging benchmarks
  • Analyze and optimize CPU workloads, identifying performance bottlenecks and inefficiencies in microarchitecture and software interactions.
  • Reduce workloads for CPU performance modeling and performance correlation
  • Run performance models, EDA frameworks, and/or profiling tools to measure, characterize and predict CPU behavior under various workloads.
  • Collaborate with CPU architects and hardware designers to enhance microarchitectural features and overall processor efficiency.
  • Work closely with software engineers to optimize applications, compilers, supporting libraries, and operating systems for better CPU performance.
  • Drive key workload performance optimization on existing hardware by developing handwritten kernels  .
  • Publish performance tuning guidelines and best practices for internal teams and external developers and customers.
  • Stay abreast of industry trends, new workload requirements, and advancements in CPU performance analysis and optimization techniques.

Experience & Qualifications:


  • Bachelor’s, Master’s, or PhD in Computer Engineering, Electrical Engineering, or a related field.
  • At least 5 years of research or industry experience in CPU performance analysis, workload optimization, or microarchitecture design.
  • Strong expertise in microarchitectural performance tuning, profiling, and analysis.
  • Proficiency in performance profiling tools such as perf, VTune, AMDuProf or proprietary CPU performance analysis tools.
  • Deep understanding of CPU microarchitecture concepts, including CPU pipelines, memory hierarchy, and branch prediction.
  • Experience with benchmarking and workload characterization methodologies.
  • Strong programming skills in C/C++, assembly language (with a focus on vector-length agnostic programming), and scripting languages such as Python and Shell.
  • Knowledge of operating system internals, compilers, and performance optimizations at the OS level.
  • Experience with GCC or LLVM compiler development is a plus.
  • Familiarity with cloud computing and HPC workloads is a plus.
  • Excellent problem-solving skills and ability to work across multidisciplinary teams.

Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.Due to U.S. Export Control laws and regulations, Tenstorrent is required to ensure compliance with licensing regulations when transferring technology to nationals of certain countries that have been licensing conditions set  by the U.S.

government.Our engineering positions and certain engineering support positions require access to information, systems, or technologies that are subject to U.S. Export Control laws and regulations, please note that citizenship/permanent residency, asylee and refugee information and/or documentation will be required and considered as Tenstorrent moves through the employment process.If a U.S. export license is required, employment will not begin until a license with acceptable conditions is granted by the U.S.

government.  If a U.S. export license with acceptable conditions is not granted by the U.S. government, then the offer of employment will be rescinded.

Life at Tenstorrent

At Tenstorrent, we are creating the next generation of high-performance processor ASICs, specifically engineered for deep learning and smart hardware. Our processor is designed to excel at both learning and inference, while being software-programmable to support future innovations in the field of machine learning. The processor's architecture easily scales from battery-powered IoT devices to large cloud servers, and surpasses today's solutions by several orders of magnitude in raw performance and energy efficiency. Our team, made up of alumni from hardware industry leaders like NVIDIA and AMD, is committed to providing the core hardware necessary to increase the pace of deep learning research and enable smart devices to live untethered from the power grid and the Internet. We are based in Toronto and proudly backed by Real Ventures, the Canadian VC of the Year two years running.
Thrive Here & What We Value* Innovation, collaboration, problem-solving* Competitive compensation package* Diverse team with varying seniorities* Hybrid work arrangement (Santa Clara, CA; Austin, TX)* Equal opportunity employer* Cutting-edge AI technology leadership* Passionate technologists in diverse teams* High performance RISCV CPU development

Related Sub

This job belongs to these sub. Explore related roles here:
Your tracker settings

We use cookies and similar methods to recognize visitors and remember their preferences. We also use them to measure ad campaign effectiveness, target ads and analyze site traffic. To learn more about these methods, including how to disable them, view our Cookie Policy or Privacy Policy.

By tapping `Accept`, you consent to the use of these methods by us and third parties. You can always change your tracker preferences by visiting our Cookie Policy.

logo innerThatStartupJob
Discover the best startup and their job positions, all in one place.
Copyright © 2025