Bioinformatics

45 skills with this tag

K-Dense-AI
Passed
Latchbio Integration
This skill provides comprehensive documentation for integrating with Latch, a Python framework for bioinformatics workflows. It covers workflow creation with @workflow/@task decorators, cloud data management with LatchFile/LatchDir, resource configuration for CPU/GPU tasks, and access to verified workflows like AlphaFold and DESeq2.
BioinformaticsScientific ComputingWorkflow Automation+3
1102.5k
K-Dense-AI
Passed
Dnanexus Integration
DNAnexus cloud genomics platform. Build apps/applets, manage data (upload/download), dxpy Python SDK, run workflows, FASTQ/BAM/VCF, for genomics pipeline development and execution.
DnanexusGenomicsBioinformatics+3
702.5k
K-Dense-AI
Passed
Exploratory Data Analysis
Perform comprehensive exploratory data analysis on scientific data files across 200+ file formats. This skill should be used when analyzing any scientific data file to understand its structure, content, quality, and characteristics. Automatically detects file type and generates detailed markdown reports with format-specific analysis, quality metrics, and downstream analysis recommendations. Covers chemistry, bioinformatics, microscopy, spectroscopy, proteomics, metabolomics, and general scientific data formats.
Data AnalysisScientific ComputingBioinformatics+3
802.5k
K-Dense-AI
Passed
Uniprot Database
Direct REST API access to UniProt. Protein searches, FASTA retrieval, ID mapping, Swiss-Prot/TrEMBL. For Python workflows with multiple databases, prefer bioservices (unified interface to 40+ services). Use this for direct HTTP/REST work or UniProt-specific control.
BioinformaticsProtein DatabaseApi Client+3
502.5k
K-Dense-AI
Passed
String Database
Query STRING API for protein-protein interactions (59M proteins, 20B interactions). Network analysis, GO/KEGG enrichment, interaction discovery, 5000+ species, for systems biology.
BioinformaticsProtein InteractionsSystems Biology+3
502.5k
K-Dense-AI
Passed
Reactome Database
This skill enables querying the Reactome biological pathway database to perform pathway enrichment analysis, map genes to pathways, and explore molecular interactions. It provides a Python helper script and REST API documentation for systems biology research.
BioinformaticsPathway AnalysisSystems Biology+3
502.5k
K-Dense-AI
Passed
Pdb Database
This skill provides comprehensive guidance for accessing the RCSB Protein Data Bank programmatically. It enables searching for protein and nucleic acid 3D structures by text, sequence, or structural similarity, downloading coordinate files in various formats, and retrieving structural metadata for applications in drug discovery and structural biology.
BioinformaticsProtein StructureStructural Biology+3
502.5k
K-Dense-AI
Passed
Opentargets Database
Query Open Targets Platform for target-disease associations, drug target discovery, tractability/safety data, genetics/omics evidence, known drugs, for therapeutic target identification.
Drug DiscoveryBioinformaticsTarget Identification+3
702.5k
K-Dense-AI
Passed
Metabolomics Workbench Database
This skill provides comprehensive guidance for using the NIH Metabolomics Workbench REST API, a repository of over 4,200 metabolomics studies. It enables querying metabolite structures by various identifiers, accessing study metadata and experimental results, standardizing nomenclature with RefMet, performing mass spectrometry m/z searches, and retrieving gene/protein-metabolite associations.
MetabolomicsBioinformaticsMass Spectrometry+3
702.5k
K-Dense-AI
Passed
Kegg Database
Direct REST API access to KEGG (academic use only). Pathway analysis, gene-pathway mapping, metabolic pathways, drug interactions, ID conversion. For Python workflows with multiple databases, prefer bioservices. Use this for direct HTTP/REST work or KEGG-specific control.
BioinformaticsPathway AnalysisGenomics+3
702.5k
K-Dense-AI
Passed
Hmdb Database
Access Human Metabolome Database (220K+ metabolites). Search by name/ID/structure, retrieve chemical properties, biomarker data, NMR/MS spectra, pathways, for metabolomics and identification.
MetabolomicsBioinformaticsBiomarker Discovery+3
502.5k
K-Dense-AI
Passed
Gwas Database
Query NHGRI-EBI GWAS Catalog for SNP-trait associations. Search variants by rs ID, disease/trait, gene, retrieve p-values and summary statistics, for genetic epidemiology and polygenic risk scores.
GwasGeneticsGenomics+3
602.5k
K-Dense-AI
Passed
Geo Database
Access NCBI GEO for gene expression/genomics data. Search/download microarray and RNA-seq datasets (GSE, GSM, GPL), retrieve SOFT/Matrix files, for transcriptomics and expression analysis.
Gene ExpressionTranscriptomicsBioinformatics+3
602.5k
K-Dense-AI
Passed
Gene Database
Query NCBI Gene via E-utilities/Datasets API. Search by symbol/ID, retrieve gene info (RefSeqs, GO, locations, phenotypes), batch lookups, for gene annotation and functional analysis.
BioinformaticsGenomicsNcbi+3
602.5k
K-Dense-AI
Passed
Ensembl Database
Query Ensembl genome database REST API for 250+ species. Gene lookups, sequence retrieval, variant analysis, comparative genomics, orthologs, VEP predictions, for genomic research.
GenomicsBioinformaticsEnsembl+3
802.5k
K-Dense-AI
Passed
Ena Database
Access European Nucleotide Archive via API/FTP. Retrieve DNA/RNA sequences, raw reads (FASTQ), genome assemblies by accession, for genomics and bioinformatics pipelines. Supports multiple formats.
GenomicsBioinformaticsNucleotide Sequences+3
802.5k
K-Dense-AI
Passed
Drugbank Database
This skill enables access to the DrugBank pharmaceutical database containing ~9,591 drug entries with comprehensive drug information including chemical properties, drug-drug interactions, protein targets, metabolic pathways, and ADMET predictions. It supports drug discovery research, polypharmacy safety analysis, drug repurposing studies, and pharmacology research using Python with RDKit for chemical analysis.
DrugbankPharmaceuticalDrug Interactions+3
702.5k
K-Dense-AI
Passed
Cosmic Database
Access COSMIC cancer mutation database. Query somatic mutations, Cancer Gene Census, mutational signatures, gene fusions, for cancer research and precision oncology. Requires authentication.
BioinformaticsCancer ResearchGenomics+3
802.5k
K-Dense-AI
Passed
Clinvar Database
This skill provides comprehensive guidance for accessing NCBI's ClinVar database, which contains clinical interpretations of human genetic variants. It covers searching via the E-utilities API, interpreting clinical significance classifications (pathogenic, benign, VUS), downloading bulk data from FTP, and processing variants in XML, VCF, and tab-delimited formats for genomic medicine applications.
GenomicsClinical GeneticsBioinformatics+3
702.5k
K-Dense-AI
Passed
Clinicaltrials Database
Query ClinicalTrials.gov via API v2. Search trials by condition, drug, location, status, or phase. Retrieve trial details by NCT ID, export data, for clinical research and patient matching.
Clinical TrialsHealthcareMedical Research+3
602.5k
K-Dense-AI
Passed
Chembl Database
Query ChEMBL's bioactive molecules and drug discovery data. Search compounds by structure/properties, retrieve bioactivity data (IC50, Ki), find inhibitors, perform SAR studies, for medicinal chemistry.
ChemblDrug DiscoveryBioinformatics+3
702.5k
K-Dense-AI
Passed
Brenda Database
Access BRENDA enzyme database via SOAP API. Retrieve kinetic parameters (Km, kcat), reaction equations, organism data, and substrate-specific enzyme information for biochemical research and metabolic pathway analysis.
EnzymologyBiochemistryKinetics+3
702.5k
K-Dense-AI
Passed
Alphafold Database
Access AlphaFold's 200M+ AI-predicted protein structures. Retrieve structures by UniProt ID, download PDB/mmCIF files, analyze confidence metrics (pLDDT, PAE), for drug discovery and structural biology.
ResearchProtein StructureBioinformatics+3
802.5k
K-Dense-AI
Passed
Scvi Tools
This skill provides detailed documentation and best practices for analyzing single-cell omics data using scvi-tools. It covers probabilistic modeling for scRNA-seq, scATAC-seq, CITE-seq, spatial transcriptomics, and other single-cell modalities including batch correction, differential expression, and cell type annotation.
BioinformaticsSingle CellGenomics+3
702.5k
K-Dense-AI
Passed
Scikit Bio
Biological data toolkit. Sequence analysis, alignments, phylogenetic trees, diversity metrics (alpha/beta, UniFrac), ordination (PCoA), PERMANOVA, FASTA/Newick I/O, for microbiome analysis.
BioinformaticsPythonMicrobiome+3
602.5k
K-Dense-AI
Passed
Scanpy
This skill enables complete single-cell RNA-seq analysis using the scanpy Python toolkit. It guides users through quality control, normalization, dimensionality reduction (PCA/UMAP/t-SNE), Leiden clustering, marker gene identification, and cell type annotation, with included scripts and templates for automated workflows.
BioinformaticsSingle CellRna Seq+3
502.5k
K-Dense-AI
Passed
Pysam
Pysam is a comprehensive reference skill for bioinformatics developers working with genomic data. It provides documentation and code examples for reading, manipulating, and writing sequencing alignment files (SAM/BAM/CRAM), genetic variant files (VCF/BCF), and sequence files (FASTA/FASTQ) using the pysam Python library.
BioinformaticsGenomicsPython+3
602.5k
K-Dense-AI
Passed
Pyopenms
Python interface to OpenMS for mass spectrometry data analysis. Use for LC-MS/MS proteomics and metabolomics workflows including file handling (mzML, mzXML, mzTab, FASTA, pepXML, protXML, mzIdentML), signal processing, feature detection, peptide identification, and quantitative analysis. Apply when working with mass spectrometry data, analyzing proteomics experiments, or processing metabolomics datasets.
BioinformaticsMass SpectrometryProteomics+3
1002.5k
K-Dense-AI
Passed
Pydeseq2
Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq analysis.
Rna SeqBioinformaticsDifferential Expression+3
502.5k
K-Dense-AI
Passed
Lamindb
This skill should be used when working with LaminDB, an open-source data framework for biology that makes data queryable, traceable, reproducible, and FAIR. Use when managing biological datasets (scRNA-seq, spatial, flow cytometry, etc.), tracking computational workflows, curating and validating data with biological ontologies, building data lakehouses, or ensuring data lineage and reproducibility in biological research. Covers data management, annotation, ontologies (genes, cell types, diseases, tissues), schema validation, integrations with workflow managers (Nextflow, Snakemake) and MLOps platforms (W&B, MLflow), and deployment strategies.
BioinformaticsData ManagementOntologies+3
702.5k