
Align every FASTQ sample
Dataset: paired-end FASTQ manifest
import os
import subprocess
import time
from pathlib import Path
from burla import remote_parallel_map
IMAGE = "us-docker.pkg.dev/test-burla/burla-demos/burla-bio-worker:latest"
REF_FASTA = "/refs/hg38.fa"
S3_OUT = "s3://my-bam-bucket"Step 1: Use an image with the native tools
Step 2: Align one sample per worker
Step 3: Smoke test one sample
Step 4: Run the cohort
What's the point?
Last updated