ImmuneID specializes in precision immunology, using software to analyze vast amounts of genomic data. Their original setup had limitations: data was stored on-premises and was not easily accessible for concurrent use by their team. Cloud303 transformed ImmuneID’s data management by implementing a nightly backup to Amazon S3 and integrating AppStream 2.0 with R Studio for seamless data processing. This not only cut costs but also empowered scientists with real-time data access.
ImmuneID was wrestling with data bottlenecks due to its reliance on on-premises storage solutions. Cloud303 - an AWS Premier Consulting Partner - stepped in to provide a cost-effective, scalable solution that would enable simultaneous, efficient data access for data analytics using RStudio.
Cloud303's engagements follow a streamlined five-phase lifecycle: Requirements, Design, Implementation, Testing, and Maintenance. Initially, a comprehensive assessment is conducted through a Well-Architected Review to identify client needs. This is followed by a scoping call to fine-tune the architectural design, upon which a Statement of Work (SoW) is agreed and signed.
The implementation phase kicks in next, closely adhering to the approved designs. Rigorous testing ensures that all components meet the client's specifications and industry standards. Finally, clients have the option to either manage the deployed solutions themselves or to enroll in Cloud303's Managed Services for ongoing maintenance, an option many choose due to their high satisfaction with the services provided.
Upon an initial on-site assessment, Cloud303 connected ImmuneID's gene sequencer directly to Amazon S3, implementing a nightly automated backup system. To facilitate data analysis, we engineered an AppStream 2.0 golden image pre-loaded with RStudio.
However, the key innovation was in merging the capabilities of Amazon FSx and Amazon S3 within the AppStream 2.0 environment. This was accomplished using a combination of batch and PowerShell scripts along with a third-party application called RClone. This setup made it possible for scientists to transition data seamlessly between the high-throughput FSx and the cost-effective S3 storage solutions.
In technical terms, the configuration enabled a Windows file system mount of the S3 bucket, allowing easy data transfer to FSx whenever high-speed data manipulation was needed. We also rolled out a memory-optimized AppStream 2.0 fleet to ensure peak performance, especially when running complex R scripts.
Designing the solution for ImmuneID was akin to solving a multidimensional puzzle. Using a blend of AWS services, we achieved a solution that was both cost-effective and highly efficient.
ImmuneID’s new data ecosystem, powered by AWS and architected by Cloud303, has revolutionized their genomic data analytics. Scientists can now access and analyze data concurrently, thereby speeding up research timelines. This solution's elasticity allows ImmuneID to quickly scale their operations in line with business growth. Moreover, the cost savings have been substantial, enabling the re-allocation of financial resources to other critical research areas.