### About this dataset

This LLM dataset can serve as a catalyst for AI teams building models for healthcare. This meticulously curated collection comprises hundreds of thousands of real-world physician prompts and their corresponding machine-generated responses.

Through collaboration with general practitioners (primary care doctors) across four markets, we have cultivated a diverse and representative dataset. The dataset includes both clinical and non-clinical conversations, segmented by various vertices such as country, region, specialty, gender, and age group.

The dataset does not contain any personally identifiable information or patient data.

Q&A pairs are available in English (US), Italian, French (France), Portuguese (Portugal) and Spanish (Spain) languages.

This dataset is covered by our standard [Data License Agreement](https://www.defined.ai/data-license-agreement). The license agreement is perpetual and allows for the commercialization of all models built on the data.

### Samples
#### Preview
![Healthcare_QA_LLM_Sample_3.png](https://prdstrapimediastorage.blob.core.windows.net/prdstrapimediastorage/assets/Healthcare_QA_LLM_Sample_2_json_aff8116136.png)


Fill in the form, and get access to a free sample of this dataset

### Full Sample
Download full sample of LLM medical prompts [here](https://prdstrapimediastorage.blob.core.windows.net/prdstrapimediastorage/assets/LLM_Sample_Medical_Prompts_e4c029865c.xlsx).

Healthcare Prompt and Response Data

Download Sample

You might also be interested in:

STEM Q&A Pairs

Named Entity Recognition

English Spontaneous Dialogue

Aspect-Based Sentiment Analysis