tooluniverse-chemical-compound-retrieval

mims-harvard/tooluniverse · updated Apr 8, 2026

$npx skills add https://github.com/mims-harvard/tooluniverse --skill tooluniverse-chemical-compound-retrieval
0 commentsdiscussion
summary

Retrieve comprehensive chemical compound data with proper disambiguation and cross-database validation.

skill.md

Chemical Compound Information Retrieval

Retrieve comprehensive chemical compound data with proper disambiguation and cross-database validation.

LOOK UP DON'T GUESS: Never assume a CID, ChEMBL ID, or molecular property value. Always retrieve from PubChem/ChEMBL.

English-first: Always use English compound names in tool calls. Respond in user's language.

Domain Reasoning: Disambiguation

"Aspirin" = one compound. "Vitamin D" = multiple forms (D2/D3/active metabolite). For generic class names (steroids, vitamins, acids), present candidates and confirm before proceeding.


Workflow

Phase 0: Clarify (only if highly ambiguous -- skip for unambiguous names or specific IDs)
Phase 1: Disambiguate → resolve PubChem CID + ChEMBL ID
Phase 2: Retrieve data (silent)
Phase 3: Report compound profile

Phase 1: Disambiguation

# By name
result = tu.tools.PubChem_get_CID_by_compound_name(compound_name=name)
# By SMILES
result = tu.tools.PubChem_get_CID_by_SMILES(smiles=smiles)
# Cross-reference
chembl_result = tu.tools.ChEMBL_search_compounds(query=name, limit=5)

Verify: CID + ChEMBL ID + canonical SMILES + stereochemistry + salt forms.

Phase 2: Data Retrieval

PubChem: PubChem_get_compound_properties_by_CID, PubChemBioAssay_get_assay_summary, PubChemTox_get_acute_effects, PubChem_get_compound_2D_image_by_CID

ChEMBL: ChEMBL_get_bioactivity_by_chemblid, ChEMBL_get_target_by_chemblid, ChEMBL_get_assays_by_chemblid

Optional: PubChem_get_associated_patents_by_CID, PubChem_search_compounds_by_similarity

Phase 3: Report

Compound Profile with: Identity (CID, ChEMBL ID, IUPAC, SMILES), Chemical Properties (MW, LogP, HBD, HBA, PSA, Lipinski), Bioactivity (targets, IC50/Ki), Drug Info (if approved), Data Sources.


Fallback Chains

Primary Fallback
PubChem name lookup ChEMBL search → SMILES → PubChem_get_CID_by_SMILES
ChEMBL bioactivity PubChem bioassay summary
Drug label Note "unavailable"

Evidence Grading

Grade Criteria
Confirmed CID + ChEMBL cross-match, InChI/SMILES agree
Probable CID found, partial ChEMBL match
Uncertain Single database only, or multiple CIDs
Unverified No cross-reference, single-source

Bioactivity: ChEMBL > PubChem BioAssay for curated data. IC50/Ki < 100nM = potent, 100nM-1uM = moderate, >10uM = weak. Lipinski violations reduce oral bioavailability but don't disqualify.


SMILES Verification

Always verify novel SMILES: python3 src/tooluniverse/tools/smiles_verifier.py --smiles "SMILES_STRING". Invalid SMILES produce wrong results or cryptic errors.


Tool Reference

PubChem: PubChem_get_CID_by_compound_name, PubChem_get_CID_by_SMILES, PubChem_get_compound_properties_by_CID, PubChem_get_compound_2D_image_by_CID, PubChemBioAssay_get_assay_summary, PubChemTox_get_acute_effects, PubChem_get_associated_patents_by_CID, PubChem_search_compounds_by_similarity, PubChem_search_compounds_by_substructure

ChEMBL: ChEMBL_search_drugs, ChEMBL_get_molecule, ChEMBL_get_activity, ChEMBL_get_target, ChEMBL_search_targets, ChEMBL_search_assays

Discussion

Product Hunt–style comments (not star reviews)
  • No comments yet — start the thread.
general reviews

Ratings

4.626 reviews
  • Dev Huang· Dec 20, 2024

    We added tooluniverse-chemical-compound-retrieval from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

  • Shikha Mishra· Dec 8, 2024

    tooluniverse-chemical-compound-retrieval is among the better-maintained entries we tried; worth keeping pinned for repeat workflows.

  • Yash Thakker· Nov 27, 2024

    Keeps context tight: tooluniverse-chemical-compound-retrieval is the kind of skill you can hand to a new teammate without a long onboarding doc.

  • Ama Lopez· Nov 23, 2024

    I recommend tooluniverse-chemical-compound-retrieval for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.

  • Advait Robinson· Nov 11, 2024

    Useful defaults in tooluniverse-chemical-compound-retrieval — fewer surprises than typical one-off scripts, and it plays nicely with `npx skills` flows.

  • Dhruvi Jain· Oct 18, 2024

    Registry listing for tooluniverse-chemical-compound-retrieval matched our evaluation — installs cleanly and behaves as described in the markdown.

  • Isabella Brown· Oct 14, 2024

    tooluniverse-chemical-compound-retrieval reduced setup friction for our internal harness; good balance of opinion and flexibility.

  • Advait Sethi· Oct 2, 2024

    tooluniverse-chemical-compound-retrieval has been reliable in day-to-day use. Documentation quality is above average for community skills.

  • Oshnikdeep· Sep 25, 2024

    tooluniverse-chemical-compound-retrieval reduced setup friction for our internal harness; good balance of opinion and flexibility.

  • Lucas Li· Sep 1, 2024

    We added tooluniverse-chemical-compound-retrieval from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

showing 1-10 of 26

1 / 3