Healthcare Prompt and Response Data

Healthcare
English
Italian
French
Portuguese (Portugal)
Spanish

About this dataset

This LLM dataset can serve as a catalyst for AI teams building models for healthcare. This meticulously curated collection comprises hundreds of thousands of real-world physician prompts and their corresponding machine-generated responses.

Through collaboration with general practitioners (primary care doctors) across four markets, we have cultivated a diverse and representative dataset. The dataset includes both clinical and non-clinical conversations, segmented by various vertices such as country, region, specialty, gender, and age group.

The dataset does not contain any personally identifiable information or patient data.

Q&A pairs are available in English (US), Italian, French (France), Portuguese (Portugal) and Spanish (Spain) languages.

This dataset is covered by our standard Data License Agreement. The license agreement is perpetual and allows for the commercialization of all models built on the data.

Samples

Preview

Healthcare_QA_LLM_Sample_3.png

Download Sample

Fill in the form, and get access to a free sample of this dataset

All fields are required

By downloading, installing, accessing, and/or using this data sample, you consent to receive communications from Defined.ai and affirm your acceptance of our Privacy Policy, Terms of Use, and Data License Agreement. Consent can be revoked at your discretion.

You might also be interested in:

STEM Q&A Pairs

STEM Question-Answer Dataset of 150,000 units coming soon
English
Chemistry
Mathematics
+4

© 2025 DefinedCrowd. All rights reserved.