Lustre File System Engineer
Location: Remote (Worldwide)
Employment Type: Full-Time
Company: The Lustre Collective
About The Lustre Collective
The Lustre Collective is dedicated to advancing the Lustre file system, the leading open-source, parallel distributed file system designed for large-scale cluster computing. With deep expertise in scalable storage solutions, our team supports cutting-edge AI and HPC environments, on-premises and in the cloud. We are committed to vendor-neutral development, drawing inspiration from communities like OpenSFS and EOFS to drive innovation in Lustre technology.
As a tight-knit company with a fun, collaborative culture, we offer the unique opportunity to work alongside some of the smartest and most qualified file system and kernel engineers in the world. Our team thrives on innovation, open-source passion, and a supportive environment where ideas flow freely and work-life balance is prioritized. If you're excited about contributing to groundbreaking storage solutions in a dynamic, global team, we'd love to hear from you!
Job Overview
We are seeking a talented Lustre File System Engineer to join our remote team. In this role, you will contribute to the ongoing development, optimization, and maintenance of the Lustre file system, helping to deliver high-performance, scalable storage solutions for AI, HPC, and enterprise applications. This is an opportunity to make a real impact on open-source technology while collaborating with industry-leading experts in a fun, inclusive culture.
Key Responsibilities
- Design, implement, and optimize features in the Lustre file system, focusing on performance, scalability, and reliability in distributed environments.
- Develop and debug kernel-level code to enhance file system functionality, IO paths, and integration with HPC and cloud infrastructures.
- Collaborate with the team on vendor-neutral innovations, including AI-driven optimizations and community-driven enhancements inspired by OpenSFS and EOFS.
- Troubleshoot complex issues in large-scale cluster deployments, including performance tuning, fault tolerance, and data integrity.
- Contribute to open-source Lustre repositories, documentation, and upstream integration efforts.
- Participate in code reviews, knowledge-sharing sessions, and team discussions to foster a collaborative, innovative atmosphere.
- Stay abreast of advancements in storage technologies, kernel development, and parallel file systems to drive continuous improvement.
Required Qualifications and Skills
- Strong proficiency in C programming, with experience in low-level systems development.
- Expertise in Linux kernel programming, including module development, debugging, and performance optimization.
- In-depth knowledge of file systems, IO paths, and storage architectures (e.g., distributed file systems, RAID, networking protocols like RDMA).
- Familiarity with Lustre or similar parallel file systems (e.g., GPFS, Ceph) is highly preferred; hands-on experience with Lustre deployment, configuration, or development is a plus.
- Experience with distributed systems, high-performance computing (HPC), or cloud storage environments.
- Proficiency in tools such as Git, gdb, perf, and other kernel debugging utilities.
- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
- Excellent problem-solving skills, with a passion for tackling complex technical challenges.
- Strong communication and collaboration abilities, suited to a remote, global team environment.
Preferred Qualifications
- Contributions to open-source projects, particularly in kernel or file system development.
- Experience with performance profiling, benchmarking, and optimization in large-scale clusters.
- Knowledge of AI/ML workloads, containerization (e.g., Docker, Kubernetes), or cloud platforms (e.g., AWS, Azure, GCP).
- Background in networking, security, or data management in HPC contexts.
What We Offer
- A fully remote position with flexible hours to accommodate global time zones.
- Competitive salary and benefits package available
- The chance to work in a fun, tight-knit culture where creativity and humor are encouraged—think virtual team-building events, hackathons, and a supportive community of experts.
- Opportunities for growth, including conferences, training, and leadership in open-source initiatives.
- A commitment to diversity, equity, and inclusion, welcoming candidates from all backgrounds worldwide.
If you're a passionate engineer ready to advance the future of Lustre and collaborate with world-class talent, apply today!
Contact Us to Apply