QualificationsNetwork protocolsSystem architectureSoCSystem designCommunication skillsDoctoral degreeBachelor’s degreeMaster’s degreeDoctor of Philosophy
AI/HPC Systems Engineer Responsibilities:
Define Meta Infrastructure AI platform, system and cluster SW and HW and networking architect.
Generate system and platform SW & HW requirement documents based on use cases needs, system performance and power analysis, operation requirements, cross functional team input, and available technologies.
Expertise with computer technologies and accelerators (e.g. GPU).
Lead multi-disciplinary teams to identify SW & HW solutions and assess trade-offs for feasible implementation.
Build proto-type for proof of concept, to quickly explore a variety of options with good understanding of both technical and business impact, and demonstrate effective collaborations with cross functional partners and stakeholders.
Collaborate with and influence external partners and open-source communities to accelerate innovations, and create options for Meta infrastructure. Provide technical design guidance for AI SW & HW platform, SoC and component design activities.
Influences the shaping of future product solutions by significantly contributing to the architecture and technology. Provides multilayered technical expertise for next generation initiatives.
Minimum Qualifications:
Bachelor’s degree or higher in Computer Engineering, Computer Science, Electrical Engineering, or similar field.
15+ years of diverse computing platform and system architecture, system-on-chip architecture, and implementation experience.
Skills to abstract, breakdown, analysis, and drive solutions to complex problems. Communication skills, and documentation skills.
Skills in adapting and learning based on collaboration, team growth, business, and technology needs.
Preferred Qualifications:
MS or PhD in computer engineering or electrical engineering or related field.
Experiences with AI platform, network, and system architecture.
Experience with HPC technologies and design.
Experience with SW stacks including Firmware and Compilers for HPC.
Linux OS and software knowledge of compute systems.
Experiences with ASIC design and development.
Experience with development of BMC and system Firmware.
Experience with development of networking protocols such as RoCE, InfiniBand etc.
Experience with development of memory and/or I/O buses (DDR, HBM, PCIe, CXL, NVLINK, or XGMI etc.).
Experience with development of storage buses (NVMe, NVMeoF, SATA, SAS).
Experience with engaging with application customers, and hands on to run benchmarks on compute systems.
Experience with performance and power trade-off, optimization of HPC/AI systems.
Facebook is proud to be an Equal Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law.Facebook is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected]