Install
openclaw skills install bioinformaticsAnalyze DNA, RNA, and protein sequences with alignment, variant calling, and expression analysis pipelines.
openclaw skills install bioinformaticsOn first use, read setup.md for integration guidelines. Create ~/bioinformatics/ with user consent to store project context and preferences.
User needs to analyze biological sequences, run genomic pipelines, or interpret sequencing data. Agent handles sequence alignment, variant calling, expression analysis, and format conversions.
Memory lives in ~/bioinformatics/. See memory-template.md for structure.
~/bioinformatics/
├── memory.md # Projects, preferences, reference genomes
├── pipelines/ # Saved pipeline configurations
└── results/ # Analysis outputs and logs
| Topic | File |
|---|---|
| Setup process | setup.md |
| Memory template | memory-template.md |
| File formats | formats.md |
| Tool commands | tools.md |
| RNA-seq pipeline | rnaseq.md |
| Variant calling | variants.md |
Before any analysis, check input data quality:
samtools quickcheck)bcftools view -h)Bad input → garbage output. Always QC first.
Track which reference is used per project:
Store reference info in ~/bioinformatics/memory.md per project.
NEVER modify original FASTQ/BAM files:
Bioinformatics commands can consume massive resources:
samtools view | ...)Every analysis must be reproducible:
samtools --version)chr1 vs 1 causes silent failures. Check and convert with sed 's/^chr//'.bai, VCF needs .tbi. Commands fail cryptically without them--threads wisely| Format | Purpose | Key Tool |
|---|---|---|
| FASTA | Reference sequences | samtools faidx |
| FASTQ | Raw reads + quality | seqtk, fastp |
| SAM/BAM | Aligned reads | samtools |
| VCF/BCF | Variants | bcftools |
| BED | Genomic intervals | bedtools |
| GFF/GTF | Gene annotations | gffread |
| BigWig | Coverage tracks | deepTools |
# FASTQ quality report
fastqc sample.fastq.gz -o qc_reports/
# Trim adapters + low quality
fastp -i R1.fq.gz -I R2.fq.gz -o R1.clean.fq.gz -O R2.clean.fq.gz
# BAM statistics
samtools flagstat aligned.bam
samtools stats aligned.bam > stats.txt
# Index reference (once)
bwa index reference.fa
# Align paired-end reads
bwa mem -t 8 reference.fa R1.fq.gz R2.fq.gz | \
samtools sort -o aligned.bam -
# Index BAM
samtools index aligned.bam
# Call variants
bcftools mpileup -Ou -f reference.fa aligned.bam | \
bcftools call -mv -Oz -o variants.vcf.gz
# Index VCF
bcftools index variants.vcf.gz
# Filter variants
bcftools filter -s LowQual -e 'QUAL<20' variants.vcf.gz
# Extract region
samtools view -b aligned.bam chr1:1000000-2000000 > region.bam
# Convert BAM to FASTQ
samtools fastq -1 R1.fq.gz -2 R2.fq.gz aligned.bam
# Merge BAMs
samtools merge merged.bam sample1.bam sample2.bam
# Subset VCF by region
bcftools view -r chr1:1000-2000 variants.vcf.gz
Data access:
Data that stays local:
This skill does NOT:
Note: Installing tools (conda, brew) and downloading reference genomes requires internet access. These are user-initiated actions.
Install with clawhub install <slug> if user confirms:
data-analysis — statistical interpretationstatistics — hypothesis testingscience — research methodologyclawhub star bioinformaticsclawhub sync