How to Use the Medical Text Analyzer

A comprehensive guide to analyzing medical documents with AI-powered entity recognition

Quick Start

Get started in 3 simple steps:

Input your text - Upload a file or paste medical text directly
Select a model - Choose which medical entities to detect
Analyze - Click the analyze button and view results instantly

Step 1: Input Your Medical Text

Option A: Upload a File

Supported formats:

PDF - Medical reports, lab results, discharge summaries
DOCX - Word documents, clinical notes
TXT - Plain text files

Maximum file size: 10MB

Enhanced Extraction

When to enable Enhanced Extraction:

Scanned documents or images
PDFs with poor text quality
Documents with tables or complex layouts
Handwritten clinical notes

Note: Enhanced extraction uses AI-powered OCR and may incur additional processing time.

Option B: Paste Text Directly

Simply paste your medical text into the text area. Perfect for:

Quick analysis of clipboard content
Testing with sample text
Short clinical notes or snippets

Step 2: Choose Your Analysis Model

Select the type of medical entities you want to detect. Each model is specialized for different medical terminology:

🦠

Disease Recognition Model

Core Model

Detects diseases, conditions, disorders, and syndromes

Examples: Hypertension, Type 2 Diabetes, Acute Myocardial Infarction

💊

Pharmaceutical Identifier

Core Model

Identifies medications, drugs, and pharmaceutical compounds

Examples: Metformin, Lisinopril, Aspirin 81mg

🫀

Medical Anatomy Detector

Core Model

Recognizes anatomical structures, organs, and body parts

Examples: Left ventricle, Anterior descending artery, Liver

🔬

Pathology Detector

Core Model

Identifies pathological findings and abnormalities

Examples: Stenosis, Hypertrophy, Inflammation

🧬

Genomic Entity Extractor

Core Model

Extracts genomic and genetic information

Examples: BRCA1, TP53 mutation, HLA-B27

🔒

PII Detection System

Core Model

Identifies personally identifiable information for privacy

Examples: Patient names, dates of birth, MRN numbers

💡 Pro Tip: Core models are pre-loaded for instant analysis!

Additional specialized models (Protein, Chemical, Oncology, Species) are available and load on-demand when selected.

Step 3: Adjust Confidence Threshold

The confidence threshold controls which detected entities are shown in your results.

0%50%100%

Lower Threshold (30-50%)

More entities detected, but may include false positives

Good for: Exploratory analysis, catching all possibilities

Higher Threshold (70-90%)

Fewer entities, but higher accuracy and precision

Good for: Production use, reliable results

Recommended: 65% (Default)

Balanced approach that filters out most false positives while retaining true entities.

Step 4: Understanding Your Results

Results are displayed in three different views for comprehensive analysis:

📊 Table View

Sortable and filterable table of all detected entities

✓Sort by confidence, type, or position
✓Filter by entity type
✓View confidence scores as progress bars
✓See exact text positions

🔗 Graph View

Interactive visual representation of entities

✓Color-coded nodes by entity type
✓Pan and zoom for detailed inspection
✓Quick overview of entity distribution
✓Interactive exploration

✨ Highlighted Text

Original text with entities highlighted in context

✓See entities in their original context
✓Color-coded highlights by type
✓Hover for confidence scores
✓Legend showing entity counts

Statistics Summary

At the top of results, you'll see key metrics:

Total Entities: Number of entities detected
Average Confidence: Mean confidence score across all entities
Processing Time: How long analysis took (typically 50-500ms)
Model Used: Which model performed the analysis

Step 5: Export Your Results

Save your analysis results in multiple formats:

📄 JSON Export

Complete analysis data in machine-readable format

Includes: All entities, statistics, metadata, original text, timestamps

Best for: API integration, data processing, archival

📊 CSV Export

Entity data in spreadsheet-compatible format

Includes: Entity ID, type, text, confidence, position

Best for: Excel analysis, data manipulation, reporting

Tips & Best Practices

✅

For Best Results

•Use clear, well-formatted medical text
•Enable Enhanced Extraction for scanned documents
•Choose the model that matches your content (e.g., Disease model for diagnoses)
•Start with default 65% confidence and adjust based on results
•Review highlighted text view to verify entity accuracy

⚡

Performance Tips

•Shorter texts (< 5000 characters) process faster
•Core models load instantly - no wait time
•Optional models take 2-5 seconds to load first time
•File upload extracts text first, then analyzes (may take a few seconds)

🎯

Accuracy Tips

•Higher confidence threshold = more accurate but fewer results
•For exploratory analysis, try 50-60% threshold
•For production/clinical use, use 70-80% threshold
•Check multiple models if you need comprehensive entity detection
•Review entities in highlighted text to catch context-dependent meanings

💡

Use Case Examples

•Clinical notes: Use Disease + Anatomy + Pharma models
•Research papers: Use Genomic + Protein models
•De-identification: Use PII Detection model
•Lab reports: Use Pathology + Chemical models
•Drug safety: Use Pharma + Disease models

Troubleshooting

❓ File upload fails

Try these solutions:

→Check file size is under 10MB
→Ensure file format is PDF, DOCX, or TXT
→Try enabling Enhanced Extraction for scanned PDFs
→For very large files, try pasting text directly instead

❓ No entities detected

Try these solutions:

→Lower the confidence threshold to 40-50%
→Try a different model that matches your content
→Check if text contains medical terminology
→Verify text extracted correctly from uploaded file

❓ Too many false positives

Try these solutions:

→Increase confidence threshold to 70-80%
→Review entities in highlighted text view
→Use more specific model (e.g., Pharma instead of Disease)
→Filter by entity type in table view

❓ Analysis is slow

Try these solutions:

→Check if you're using Enhanced Extraction (takes 2-5s per page)
→Large files take longer to process
→First-time loading optional models adds 2-5 seconds
→Backend may be loading models on first startup (15-20 seconds)

Try It with Sample Text

Sample Medical Note

Patient presents with chest pain and shortness of breath. Past medical history includes hypertension and type 2 diabetes mellitus. Current medications: Metformin 1000mg twice daily, Lisinopril 20mg once daily, Aspirin 81mg daily. Vital signs: BP 145/92 mmHg, HR 88 bpm. Laboratory results: HbA1c 7.8%, LDL 145 mg/dL, Troponin elevated at 0.8 ng/mL. EKG shows ST elevation in leads II, III, aVF. Diagnosis: Acute inferior myocardial infarction. Plan: Cardiac catheterization, continue antiplatelet therapy.

Try analyzing this text with different models:

Disease model: Will find "hypertension", "type 2 diabetes mellitus", "myocardial infarction"
Pharma model: Will find "Metformin", "Lisinopril", "Aspirin"
Anatomy model: Will find "chest", mentions of heart-related structures
Pathology model: Will find "ST elevation", "Troponin elevated"

Need More Help?

Check out our comprehensive documentation for technical details, API reference, and advanced usage.