KLA Careers

AI Engineer: Device Parallelism Researcher

Company Overview

Calling the adventurers ready to join a company that's pushing the limits of nanotechnology to keep the digital revolution rolling. At KLA, we're making technology advancements that are bigger—and tinier—than the world has ever seen.

Who are we?  We research, develop, and manufacture the world's most advanced inspection and measurement equipment for the semiconductor and nanoelectronics industries. We enable the digital age by pushing the boundaries of technology, creating tools capable of finding defects smaller than a wavelength of visible light. We create smarter processes so that technology leaders can manufacture high-performance chips—the kind in that phone in your pocket, the tablet on your desk and nearly every electronic device you own—faster and better. We're passionate about creating solutions that drive progress and help people do what wouldn't be possible without us.  The future is calling. Will you answer?

Group/Division

With over 40 years of semiconductor process control experience, chipmakers around the globe rely on KLA to ensure that their fabs ramp next-generation devices to volume production quickly and cost-effectively. KLA's Global Products Group (GPG), which is responsible for creating all of KLA’s metrology and inspection products, is looking for forward-thinking research scientists, software engineers, application development engineers, and senior product technology process engineers to join our team and enable the movement towards advanced chip design. 

About KLA Advanced Computing Labs, India:
KLA advanced computing Labs’ (ACL) mission in India is to deliver advanced parallel computing research and software architectures for AI + HPC + Cloud solutions to accelerate KLA’s product performance. This team explores high-risk approaches, pioneering technologies, and novel methods to accelerate KLA’s algorithms and contribute to KLA’s HPC technology roadmap. We engage leading thinkers in academia, industry and KLA’s business units to create innovative parallel computing methods to enable KLA’s business growth.
 

Responsibilities

Position: (Ph.D/MS NCG position: CUDA device level parallelism)
 
KLA is hiring engineers for its Advanced Computing Labs in Chennai, India. KLA ACL is at our new research center in the IITM, Research Park. The goal of the center is to conduct computational research in parallel and distributed sub-systems and deploy them to KLA’s advanced semi-conductor platforms that are used for inspection and metrology tasks in leading fabs. These efforts are part of a larger global initiative at KLA to scale up its AI + HPC + cloud infrastructure.
 
What will you be responsible for?
 
As part of this elite R&D team, the job entails understanding core algorithms that have to expressed in various parallel computing constructs particularly HPC accelerators such as GPUs.  The first step in optimizing will be to theoretically model break-down of our AI algorithms and model it in terms of available bandwidth, computational FLOPS etc. The implementation steps will include CUDA level programming along with performance tuning to ensure that we can come close to achieving the theoretical model. The developer will be exposed to a variety of image processing, signal processing and deep learning loads that have to be optimized. A complimentary stage of optimization includes exploring existing libraries and programming in higher level constructs such C++ Parallel programming.
 
While the initial focus of the team will be on NVIDIA GPUs, the R&D team will also be looking at other GPU accelerators from other vendors as well as FPGA acceleration. You will collaborate with peer researchers in parallel computing areas and with algorithm teams in product groups. 
 

Qualifications

What we would like to see?
New/recent College graduates in Ph.D. (preferred), Dual Degree MS in EE, CS or CSE. Bachelors graduates will also be considered. 
A researcher who has a strong foundation in computer architecture, and in particular with a focus on high performance parallel processing at the device level (GPUs or CPUs/SIMD or FPGAs).
The researcher should have a strong mental model of computational loads and mapping different algorithms to parallel architectures.
Proficient in programming skills in C/Modern C++ and Python.
Experience in analyzing and tuning applications using profiling tools such as NSIGHT or VTUNES.
Good understanding and exposure to the Linux operating system at the user level.
Exposure to multiprocessor and multithreading concepts
Some familiarity with GPU programming such as CUDA, OpenCL or SYCL.
 
 
The position also requires a person with significant communication, initiative and the ability to navigate from relatively high-level requirements to low level computational models.
 
 
 
Bonus skills:
 
Any prior experience in KLA domains such as wafer inspection coupled with programming in CUDA or AVX will be a very plus. Additionally, any experience in optimizing large scale signal or computer vision algorithms would also be a major plus. It
 
Experience in FPGA programming while not essential will also be a major plus.
 
Experience in large scale distributed HPC systems, proven experience in Docker and Container orchestration and any expertise in AI Frameworks (Tensorflow) will also be welcome.
 
Finally a strong background in Modern C++ concepts (C++ 11 through C++ 17), STL library would also be a way to stand out from the crowd.