You are viewing a preview of this job. Log in or register to view more details about this job.

CPU Workload Performance Analysis Engineer

We are looking for a highly-skilled CPU Workload Performance Analysis Engineer to drive workload characterization, performance simulation, and in-depth performance analysis for cutting-edge CPU products. In this role, you will collaborate closely with CPU architects, RTL designers, and software engineers to reduce real-world applications for performance modeling, analyze workload characteristics, identify and resolve performance bottlenecks, and optimize performance-per-watt efficiency. Your work will directly influence the design and implementation of next-generation high-performance computing platforms across diverse workloads.

This role is Hybrid, based out of Santa Clara, CA or Austin,TX.

We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.

 

Responsibilities:

  • Conduct competitive analysis of the latest CPU products using industry-standard benchmarks and emerging applications.
  • Characterize CPU workloads, identifying performance bottlenecks and power inefficiencies in hardware and software interactions.
  • Collaborate with CPU architects and RTL designers to enhance microarchitectural features and improve performance/watt metrics.
  • Reduce workloads for CPU performance modeling, FPGA emulation, and model-to-RTL correlation.
  • Utilize performance models, EDA frameworks, and profiling tools to measure, characterize, and predict CPU performance and power under various workloads.
  • Stay up to date with industry trends, new workload requirements, and advancements in CPU microarchitecture and performance analysis techniques.

 

 

Experience & Qualifications:

  • Ph.D. in Computer Engineering, Electrical Engineering, or a related field.
  • Strong research background or industry expertise in benchmark construction, workload characterization, workload reduction, and performance simulation.
  • Proficiency in performance profiling tools such as Linux Perf, Strace, AMD’s uProf, Arm’s Telemetry Solution, or similar tools.
  • Deep understanding of CPU microarchitecture concepts, including superscalar pipelines, speculative execution, SIMD, and memory hierarchy.
  • Strong knowledge of operating system internals, runtimes, compilers, and GNU libraries.
  • Proficiency in C/C++, intrinsic/assembly programming, and scripting languages such as Python and Shell.
  • Excellent problem-solving and communication skills and the ability to work across multidisciplinary teams.
  • Experience with Ansible automation and GitLab CI is a plus.
  • Experience with GCC or LLVM compiler optimization is a plus
  • Familiarity with HPC and cloud virtualization is a plus.