Introduction

Intro and motivation for creating SeQuiLa.

SeQuiLa project was started at Institute of Computer Science at Warsaw University of Technology in late 2017. The main goal of the project was to extend Apache Spark with fast and scalable implementations of common bioinformatics operations need for processing large next-generation sequencing datasets such as interval joins, depth of coverage or pileup.

If you are new to Apache Spark and distributed processing we highly recommend you to get started with great Apache Spark documentation.

Beware that SeQuiLa is just an extension to Apache Spark - the combination of two gives you almost unlimited analytical power to crunch NGS data at any scale!

Feedback

Was this page helpful?

Glad to hear it! Please tell us how we can improve.

Sorry to hear that. Please tell us how we can improve.

Last modified February 1, 2023: fix:race condition for local[n] in toCoverage method (#172) (a49c32b)