Bioinformatics

45 skills with this tag

K-Dense-AI
Passed
Gtars
High-performance toolkit for genomic interval analysis in Rust with Python bindings. Use when working with genomic regions, BED files, coverage tracks, overlap detection, tokenization for ML models, or fragment analysis in computational genomics and machine learning applications.
GenomicsBioinformaticsRust+3
802.5k
K-Dense-AI
Passed
Gget
CLI/Python toolkit for rapid bioinformatics queries. Preferred for quick BLAST searches. Access to 20+ databases: gene info (Ensembl/UniProt), AlphaFold, ARCHS4, Enrichr, OpenTargets, COSMIC, genome downloads. For advanced BLAST/batch processing, use biopython. For multi-database integration, use bioservices.
BioinformaticsGenomicsBlast+3
602.5k
K-Dense-AI
Passed
Geniml
This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.
GenomicsMachine LearningBioinformatics+3
602.5k
K-Dense-AI
Passed
Flowio
Parse FCS (Flow Cytometry Standard) files v2.0-3.1. Extract events as NumPy arrays, read metadata/channels, convert to CSV/DataFrame, for flow cytometry data preprocessing.
Flow CytometryFcs FilesScientific Computing+3
602.5k
K-Dense-AI
Passed
Etetoolkit
Phylogenetic tree toolkit (ETE). Tree manipulation (Newick/NHX), evolutionary event detection, orthology/paralogy, NCBI taxonomy, visualization (PDF/SVG), for phylogenomics.
PhylogeneticsBioinformaticsTree Visualization+3
402.5k
K-Dense-AI
Passed
Esm
Comprehensive toolkit for protein language models including ESM3 (generative multimodal protein design across sequence, structure, and function) and ESM C (efficient protein embeddings and representations). Use this skill when working with protein sequences, structures, or function prediction; designing novel proteins; generating protein embeddings; performing inverse folding; or conducting protein engineering tasks. Supports both local model usage and cloud-based Forge API for scalable inference.
Protein EngineeringBioinformaticsMachine Learning+3
602.5k
K-Dense-AI
Passed
Diffdock
Diffusion-based molecular docking. Predict protein-ligand binding poses from PDB/SMILES, confidence scores, virtual screening, for structure-based drug design. Not for affinity prediction.
Molecular DockingDrug DiscoveryProtein Ligand+3
702.5k
K-Dense-AI
Passed
Deeptools
A bioinformatics skill for next-generation sequencing (NGS) data analysis using deepTools. It helps users convert BAM files to coverage tracks, perform quality control assessments, create heatmaps and profile plots, and analyze ChIP-seq, RNA-seq, and ATAC-seq data with proper normalization methods.
BioinformaticsNgs AnalysisChip Seq+3
602.5k
K-Dense-AI
Passed
Cobrapy
Constraint-based metabolic modeling (COBRA). FBA, FVA, gene knockouts, flux sampling, SBML models, for systems biology and metabolic engineering analysis.
Systems BiologyMetabolic ModelingBioinformatics+3
402.5k
K-Dense-AI
Passed
Cellxgene Census
Query CZ CELLxGENE Census (61M+ cells). Filter by cell type/tissue/disease, retrieve expression data, integrate with scanpy/PyTorch, for population-scale single-cell analysis.
BioinformaticsSingle CellGenomics+3
802.5k
K-Dense-AI
Passed
Bioservices
Primary Python tool for 40+ bioinformatics services. Preferred for multi-database workflows: UniProt, KEGG, ChEMBL, PubChem, Reactome, QuickGO. Unified API for queries, ID mapping, pathway analysis. For direct REST control, use individual database skills (uniprot-database, kegg-database).
BioinformaticsProtein AnalysisPathway Discovery+3
402.5k
K-Dense-AI
Passed
Biopython
Primary Python toolkit for molecular biology. Preferred for Python-based PubMed/NCBI queries (Bio.Entrez), sequence manipulation, file parsing (FASTA, GenBank, FASTQ, PDB), advanced BLAST workflows, structures, phylogenetics. For quick BLAST, use gget. For direct REST API, use pubmed-database.
BioinformaticsMolecular BiologySequence Analysis+3
602.5k
K-Dense-AI
Passed
Arboreto
Infer gene regulatory networks (GRNs) from gene expression data using scalable algorithms (GRNBoost2, GENIE3). Use when analyzing transcriptomics data (bulk RNA-seq, single-cell RNA-seq) to identify transcription factor-target gene relationships and regulatory interactions. Supports distributed computation for large-scale datasets.
BioinformaticsGene Regulatory NetworksTranscriptomics+3
602.5k
K-Dense-AI
Passed
Anndata
This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.
BioinformaticsSingle CellGenomics+3
802.5k
K-Dense-AI
Passed
Adaptyv
Cloud laboratory platform for automated protein testing and validation. Use when designing proteins and needing experimental validation including binding assays, expression testing, thermostability measurements, enzyme activity assays, or protein sequence optimization. Also use for submitting experiments via API, tracking experiment status, downloading results, optimizing protein sequences for better expression using computational tools (NetSolP, SoluProt, SolubleMPNN, ESM), or managing protein design workflows with wet-lab validation.
Protein EngineeringBioinformaticsLaboratory Automation+3
602.5k