Refer a friend: Referral fee program
Career Developers Inc., a distinguished staffing and consulting firm, is proud to celebrate 30 years of service excellence. As a GSA Contract holder, we offer comprehensive staffing solutions for both commercial and government sectors nationwide. By selectively partnering with clients who share our values, we ensure productive collaborations that set us apart in the industry. Our dedication to candidates involves managing expectations with precision through business intelligence, thorough interview preparation, transparent communication, and exceptional feedback throughout the process.
We are committed to advancing your career and look forward to supporting your professional growth.
Senior Storage Solutions Engineer (Parallel Computing / HPC Environment)
Location : Manhattan, NY (Hybrid) or could be FULLY REMOTE if living outside the area
Salary : Open and will commensurate on experience
Must have the following: Platform Experience - This role involves managing shared storage environments across the company including block, object, and NFS storage. Candidates must be multi-protocol experts, capable of architecting and implementing new storage solutions from scratch or integrating existing products. They require strong Python scripting skills for automation, monitoring, and performance benchmarking. Knowledge of various storage platforms like Weka, Fast, GPFS, and Vast is essential, along with deep Linux and networking understanding. The role demands escalation-level support expertise and the ability to operate independently.
Responsibilities:
- Design, build, and maintain large-scale HPC storage and compute environments to support business growth
- Optimize performance, scalability, and capacity across high-performance storage systems
- Automate deployment, configuration, and monitoring processes using Chef, Ansible, and Python
- Support hybrid storage architectures spanning on-premises and cloud (AWS, GCP) environments
- Collaborate with global engineering and infrastructure teams to enforce standards and enhance system reliability
- Troubleshoot complex performance and integration issues across hardware, OS, and distributed systems layers
- Implement observability solutions (Prometheus, Grafana, Datadog, ELK stack) to monitor and tune storage performance
- Develop and maintain infrastructure as code for storage and compute environments
Requirements:
- 5+ years in infrastructure engineering with a focus on building out distributed storage systems in a Linux environment.
- Proven hands-on experience with WEKA or similar parallel file systems (GPFS, Lustre, Ceph)
- Strong Python skills for automation and tool development
- Experience managing petabyte-scale storage systems
- Familiarity with containers, hypervisors, and public cloud infrastructure (AWS, GCP)
- Solid understanding of CI/CD, version control, and modern infrastructure practices
- Bachelor's degree in Computer Science, Engineering, or related field
- Strong problem-solving skills with the ability to work independently and in fast-paced environments
INDH
WEKA engineer, WEKA storage, WEKA file system, distributed storage engineer, storage solutions architect, Linux infrastructure engineer, high performance computing, HPC storage, GPFS, Ceph, Lustre, parallel file systems, data storage optimization, storage automation, Python scripting, storage performance tuning, cloud storage engineer, AWS storage, GCP storage, infrastructure automation, Ansible, Chef, Datadog, Prometheus, Grafana, storage scaling, capacity management, storage reliability, hybrid storage, distributed compute, HPC infrastructure