Apply for this job now

Director, Storage Operations

Location
Santa Clara, California
Job Type
Permanent
Posted
15 Oct 2020
NVIDIA is looking for an outstanding individual to lead a global team focused on storage operations. You will anticipate and support the demand of developing the next leading edge GPU. This essential role will give you a rare opportunity to influence a wide range of projects and functionality which impact all our growing businesses. You will help drive our operations strategy and grow your team both in size and skill. We need you to demonstrate proven delivery of operational excellence, an understanding of high-performance storage innovation and reliability. You will demonstrate a need to work multi-functionally with other teams to accomplish complex goals. The ideal candidate will ensure 24/7 availability and support of critical services. The role of this candidate will require 100% support to engineering, compute farm and business infrastructure environments.What You'll Be DoingManage the operations of storage infrastructure and services consisting of a mixture of enterprise appliances, networks, and open source technologiesIdentify and propose SLOs for the distributed enterprise and HPC storage systemsDocument the general procedures and practices, perform technology evaluations, and coordinate and track system orders, installations and deployment.Install and maintain configurations and solutions, as well as monitor and audit system utilization, performance and problems.Improve incident management and identify/implement the corrective actions.Work with design teams to identify new technologies that can solve operational problems. Perform trials and create appropriate solutions for meeting business needs, while reducing the complexity and variations of storage technologies.Implement and maintain storage systems and solution improvements using the UNIX, WINDOWS and Cloud platforms for diverse infrastructure needs for our business-critical applications like SAP, PDP, compute farm environments used for EDA and Deep Learning.What We Need To SeeBS in Computer Science or equivalent work experience15+ years of relevant experience10+ years of operations leadership of large-scale storage infrastructure.Demonstrated understand of root case analysis and significantly improving operational reliability.Experience or understanding of EDA environmentsBackground with file systems NFS, NTFS along with the Enterprise Storage (NAS/SAN)/Data Protection/Backup TechnologiesExperience with Parallel distributed file systems (GPFS, Lustre), virtualization (vmware, Xen, HyperV), docker container and cloud technologies.Previous experience designing and supporting object storage solutions, such as IBM COS, S3 and SwiftStack.Familiarity with Job Schedulers such as IBM Platform LSF and Sun Grid Engine a plusSolid experience in backup and restore technologies (Veritas, Cohesity, Commvault, etc.)Strong collaborative and social skills, specifically a shown ability to effectively guide and influence within a dynamic matrix environmentAble to schedule, prioritize, accomplish R activities and communicate actions and results as neededSolid attention to detail and excellent written and verbal communication skills are required.You will have solid experience in scripting / programming in some administrative language (Shell, Perl, Python, Powershell)Accomplished projects related to the distributed compute, storage, software defined storage.Good Understanding of Infrastructure SecurityDemonstrated innovations in large scale storage operations.Ways To Stand Out From The CrowdMeticulous organizer with an ever positive, can-do attitudeDemonstrate use of out-of-box thinking for creative solutions to highly sticky problemsFun and enthusiastic teammate who enjoys a challenge and celebrates successWorking knowledge of configuration management - SaltExperience with at least one of the job schedulers such as LSF, SLURM, Mesos/Marathon, Kubernetes, Docker SwarmProven operational leadership experience with large scale data center - 10,000+ serversExperience with solving Linux and ESX storage related problemsSolid experience with KerberosNVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, ambitious and enjoy having fun, then what are you waiting for apply today!NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression , sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Apply for this job now

Details

  • Job Reference: 182909326-2
  • Date Posted: 15 October 2020
  • Recruiter: Nvidia
  • Location: Santa Clara, California
  • Salary: On Application
  • Sector: I.T. & Communications
  • Job Type: Permanent