|
| 1 | +--- |
| 2 | +title: "Accelom and SYCLOPS" |
| 3 | +date: 2024-07-17 |
| 4 | +layout: update |
| 5 | +tags: |
| 6 | + - accelom |
| 7 | + - syclops |
| 8 | +--- |
| 9 | + |
| 10 | +Due to significant improvements in genomic sequencing technology, it is now possible to sequence a full human genome for |
| 11 | +less than $1000 and receive genetic data on several molecular levels of the cell (genomic, transcriptome, protein |
| 12 | +expression, copy-number variations, and so on). However, in order to turn this genomic Big Data into useful insights for |
| 13 | +precision medicine, a systematic integrated study of a range of biological datasets is required. ACCELOM develops |
| 14 | +bespoke, reproducible multi-omics software pipelines that combine numerous molecular datasets from a host to investigate |
| 15 | +association patterns between molecular levels using unique, peer-reviewed statistical machine learning algorithms. |
| 16 | + |
| 17 | + |
| 18 | + |
| 19 | +ACCELOM pipelines rely on the well-established GATK pipeline for performing secondary analysis, which transforms raw |
| 20 | +sequenced reads into analysis-ready variants. It is well known that the GATK pipeline is extremely computationally |
| 21 | +intensive[[1]][1] especially for large datasets (ex. Whole Genome Sequencing). To address this bottleneck, several hardware |
| 22 | +acceleration solutions have emerged to scale secondary analysis[[2]][2] [[3]][3]. While these solutions often improve |
| 23 | +time-to-insight by several orders of magnitude, they are closed and proprietary on both the software front (as |
| 24 | +accelerated pipelines not freely available) and hardware front (as they are tied to a specific vendor hardware). |
| 25 | + |
| 26 | +Through SYCLOPS, ACCELOM intends to collaborate with various partners in the development and validation of a SYCL-based |
| 27 | +library (SYCL-GAL) that will accelerate key stages of the GATK secondary analysis pipeline. Unlike contemporary |
| 28 | +solutions, SYCL-GAL library will be open source in nature to foster the adoption of open standards in genomics. Further, |
| 29 | +by relying on SYCL for cross-architecture parallelism, SYCL-GAL will also enable cross-platform acceleration of genomic |
| 30 | +data analysis, making it possible to deploy the same pipeline on CPUs, GPUs, and accelerators from several hardware |
| 31 | +vendors. |
| 32 | + |
| 33 | +[1]: https://www.intel.com/content/dam/www/public/us/en/documents/white-papers/deploying-gatk-best-practices-paper.pdf |
| 34 | +[2]: https://www.nvidia.com/en-us/clara/genomics/ |
| 35 | +[3]: https://emea.illumina.com/products/by-type/informatics-products/dragen-secondary-analysis/order.html |
0 commit comments