Spaces:

satyaki-mitra
/

AI_Text_Authenticator

Running

App Files Files Community

satyaki-mitra commited on Nov 7

Commit

69256da

1 Parent(s): fead05e

Updated UI

Browse files

Files changed (17) hide show

.env.example +16 -0
.gitignore +5 -8
.spacesignore +23 -0
Dockerfile +47 -0
README.md +51 -1
config/model_config.py +4 -2
data/reports/file_1762499114477_20251107_123724.pdf +137 -0
detector/attribution.py +58 -64
detector/highlighter.py +309 -139
logs/application/app_2025-11-07.log +0 -0
metrics/multi_perturbation_stability.py +326 -165
models/model_manager.py +81 -2
reporter/report_generator.py +75 -62
requirements.txt +30 -72
setup.sh +22 -0
text_auth_app.py +0 -2
ui/static/index.html +568 -104

.env.example CHANGED Viewed

	@@ -0,0 +1,16 @@

+# API Configuration
+HOST=0.0.0.0
+PORT=7860
+DEBUG=False
+ENVIRONMENT=production
+WORKERS=2
+# Model Configuration
+HF_TOKEN=your_huggingface_token_here
+OFFLINE_MODE=False
+# CORS
+CORS_ORIGINS=*
+# Logging
+LOG_LEVEL=INFO

.gitignore CHANGED Viewed

@@ -21,6 +21,10 @@ wheels/
 .installed.cfg
 *.egg
 # Virtual environments
 venv/
 env/
@@ -36,14 +40,7 @@ ENV/
 .DS_Store
 Thumbs.db
-# Logs
-logs/
-*.log
-# Data files (if you have large datasets)
-data/
-models/cache/
 # Environment variables
 .env
-.env.local

 .installed.cfg
 *.egg
+# models
+models/cache
 # Virtual environments
 venv/
 env/
 .DS_Store
 Thumbs.db
 # Environment variables
 .env
+.env.localrailway.toml

.spacesignore ADDED Viewed

	@@ -0,0 +1,23 @@

+__pycache__/
+*.pyc
+*.pyo
+*.pyd
+.Python
+*.so
+*.egg
+*.egg-info/
+dist/
+build/
+.git/
+.env
+.venv
+venv/
+*.log
+.DS_Store
+.idea/
+.vscode/
+*.swp
+*.swo
+data/uploads/*
+data/reports/*
+logs/*

Dockerfile ADDED Viewed

	@@ -0,0 +1,47 @@

+FROM python:3.10-slim
+# Set working directory
+WORKDIR /app
+# install system dependencies
+RUN apt-get update && apt-get install -y \
+    build-essential \
+    curl \
+    git \
+    && rm -rf /var/lib/apt/lists/*
+# Set environment variables
+ENV PYTHONUNBUFFERED=1 \
+    PYTHONDONTWRITEBYTECODE=1 \
+    HF_HOME=/tmp/huggingface \
+    TRANSFORMERS_CACHE=/tmp/transformers \
+    HF_DATASETS_CACHE=/tmp/datasets \
+    TOKENIZERS_PARALLELISM=false
+# Create necessary directories
+RUN mkdir -p /tmp/huggingface /tmp/transformers /tmp/datasets /app/data/reports /app/data/uploads /app/models/cache /app/logs
+# Copy requirements first for better caching
+COPY requirements.txt .
+# Install python dependencies
+RUN pip install --no-cache-dir -r requirements.txt
+# Copy application code
+COPY . .
+# Clear any incompatible cached models
+RUN rm -rf /tmp/huggingface/* /tmp/transformers/* /app/models/cache/*
+# Expose port 7860 (hugging Face Spaces Standard)
+EXPOSE 7860
+# Health check
+HEALTHCHECK --interval=30s --timeout=10s --start-period=60s --retries=5 \
+    CMD curl -f http://localhost:7860/health || exit 1
+# Run the application
+CMD ["uvicorn", "text_auth_app:app", "--host", "0.0.0.0", "--port", "7860"]

README.md CHANGED Viewed

@@ -1,3 +1,15 @@
 <div align="center">
 # 🔍 AI Text Authentication Platform
@@ -61,13 +73,51 @@ This README is research‑grade (detailed math, methodology, and benchmarks) whi
 | Feature | Description | Impact |
 |---|---:|---|
-| **Domain‑Aware Detection** | Per‑domain thresholding and weight tuning (academic, technical, creative, social) | ↑15–20% accuracy vs generic detectors |
 | **6‑Metric Ensemble** | Orthogonal signals across statistical, syntactic and semantic dimensions | Low false positives (≈2–3%) |
 | **Explainability** | Sentence‑level scoring, highlights, and human‑readable reasoning | Trust & auditability |
 | **Model Attribution** | Likely model identification (GPT‑4, Claude, Gemini, LLaMA, etc.) | Forensic insights |
 | **Auto Model Fetch** | First‑run download from Hugging Face, local cache, offline fallback | Lightweight repo & reproducible runs |
 | **Extensible Design** | Plug‑in metrics, model registry, and retraining pipeline hooks | Easy research iteration |
 ---
 ## 🏗️ System Architecture

+---
+title: Text Authentication Platform
+emoji: 🔍
+colorFrom: blue
+colorTo: purple
+sdk: docker
+sdk_version: "4.36.0"
+app_file: text_auth_app.py
+pinned: false
+license: mit
+---
 <div align="center">
 # 🔍 AI Text Authentication Platform
 | Feature | Description | Impact |
 |---|---:|---|
+| **Domain‑Aware Detection** | Calibrated thresholds and metric weights for 16 content types (Academic, Technical, Creative, Social Media, etc.) | ↑15–20% accuracy vs generic detectors |
 | **6‑Metric Ensemble** | Orthogonal signals across statistical, syntactic and semantic dimensions | Low false positives (≈2–3%) |
 | **Explainability** | Sentence‑level scoring, highlights, and human‑readable reasoning | Trust & auditability |
 | **Model Attribution** | Likely model identification (GPT‑4, Claude, Gemini, LLaMA, etc.) | Forensic insights |
 | **Auto Model Fetch** | First‑run download from Hugging Face, local cache, offline fallback | Lightweight repo & reproducible runs |
 | **Extensible Design** | Plug‑in metrics, model registry, and retraining pipeline hooks | Easy research iteration |
+### 📊 Supported Domains & Threshold Configuration
+The platform supports detection tailored to the following 16 domains, each with specific AI/Human probability thresholds and metric weights defined in `config/threshold_config.py`. These configurations are used by the ensemble classifier to adapt its decision-making process.
+**Domains:**
+*   `general` (Default fallback)
+*   `academic`
+*   `creative`
+*   `ai_ml`
+*   `software_dev`
+*   `technical_doc`
+*   `engineering`
+*   `science`
+*   `business`
+*   `legal`
+*   `medical`
+*   `journalism`
+*   `marketing`
+*   `social_media`
+*   `blog_personal`
+*   `tutorial`
+**Threshold Configuration Details (`config/threshold_config.py`):**
+Each domain is configured with specific thresholds for the six detection metrics and an ensemble threshold. The weights determine the relative importance of each metric's output during the ensemble aggregation phase.
+*   **AI Threshold:** If a metric's AI probability exceeds this value, it leans towards an "AI" classification for that metric.
+*   **Human Threshold:** If a metric's AI probability falls below this value, it leans towards a "Human" classification for that metric.
+*   **Weight:** The relative weight assigned to the metric's result during ensemble combination (normalized internally to sum to 1.0 for active metrics).
+### Confidence-Calibrated Aggregation (High Level)
+1.  Start with domain-specific base weights (defined in `config/threshold_config.py`).
+2.  Adjust these weights dynamically based on each metric's individual confidence score using a scaling function.
+3.  Normalize the adjusted weights.
+4.  Compute the final weighted aggregate probability.
 ---
 ## 🏗️ System Architecture

config/model_config.py CHANGED Viewed

@@ -20,6 +20,8 @@ class ModelType(Enum):
     EMBEDDING               = "embedding"
     RULE_BASED              = "rule_based"
     SEQUENCE_CLASSIFICATION = "sequence_classification"
 @dataclass
@@ -99,7 +101,7 @@ MODEL_REGISTRY : Dict[str, ModelConfig] = {"perplexity_gpt2"            : ModelC
                                                                                       quantizable       = True,
                                                                                      ),
                                            "multi_perturbation_base"    : ModelConfig(model_id          = "gpt2",
-                                                                                      model_type        = ModelType.GPTMASK,
                                                                                       description       = "MultiPerturbationStability model (reuses gpt2)",
                                                                                       size_mb           = 0,
                                                                                       required          = True,
@@ -108,7 +110,7 @@ MODEL_REGISTRY : Dict[str, ModelConfig] = {"perplexity_gpt2"            : ModelC
                                                                                       batch_size        = 4,
                                                                                      ),
                                            "multi_perturbation_mask"    : ModelConfig(model_id          = "distilroberta-base",
-                                                                                      model_type        = ModelType.TRANSFORMER,
                                                                                       description       = "Masked LM for text perturbation",
                                                                                       size_mb           = 330,
                                                                                       required          = True,

     EMBEDDING               = "embedding"
     RULE_BASED              = "rule_based"
     SEQUENCE_CLASSIFICATION = "sequence_classification"
+    CAUSAL_LM               = "causal_lm"
+    MASKED_LM               = "masked_lm"
 @dataclass
                                                                                       quantizable       = True,
                                                                                      ),
                                            "multi_perturbation_base"    : ModelConfig(model_id          = "gpt2",
+                                                                                      model_type        = ModelType.CAUSAL_LM,
                                                                                       description       = "MultiPerturbationStability model (reuses gpt2)",
                                                                                       size_mb           = 0,
                                                                                       required          = True,
                                                                                       batch_size        = 4,
                                                                                      ),
                                            "multi_perturbation_mask"    : ModelConfig(model_id          = "distilroberta-base",
+                                                                                      model_type        = ModelType.MASKED_LM,
                                                                                       description       = "Masked LM for text perturbation",
                                                                                       size_mb           = 330,
                                                                                       required          = True,

data/reports/file_1762499114477_20251107_123724.pdf ADDED Viewed

	@@ -0,0 +1,137 @@

+%PDF-1.4
+%���� ReportLab Generated PDF document http://www.reportlab.com
+1 0 obj
+<<
+/F1 2 0 R /F2 3 0 R /F3 4 0 R
+>>
+endobj
+2 0 obj
+<<
+/BaseFont /Helvetica /Encoding /WinAnsiEncoding /Name /F1 /Subtype /Type1 /Type /Font
+>>
+endobj
+3 0 obj
+<<
+/BaseFont /Helvetica-Bold /Encoding /WinAnsiEncoding /Name /F2 /Subtype /Type1 /Type /Font
+>>
+endobj
+4 0 obj
+<<
+/BaseFont /Helvetica-BoldOblique /Encoding /WinAnsiEncoding /Name /F3 /Subtype /Type1 /Type /Font
+>>
+endobj
+5 0 obj
+<<
+/Contents 12 0 R /MediaBox [ 0 0 612 792 ] /Parent 11 0 R /Resources <<
+/Font 1 0 R /ProcSet [ /PDF /Text /ImageB /ImageC /ImageI ]
+>> /Rotate 0 /Trans <<
+>>
+  /Type /Page
+>>
+endobj
+6 0 obj
+<<
+/Contents 13 0 R /MediaBox [ 0 0 612 792 ] /Parent 11 0 R /Resources <<
+/Font 1 0 R /ProcSet [ /PDF /Text /ImageB /ImageC /ImageI ]
+>> /Rotate 0 /Trans <<
+>>
+  /Type /Page
+>>
+endobj
+7 0 obj
+<<
+/Contents 14 0 R /MediaBox [ 0 0 612 792 ] /Parent 11 0 R /Resources <<
+/Font 1 0 R /ProcSet [ /PDF /Text /ImageB /ImageC /ImageI ]
+>> /Rotate 0 /Trans <<
+>>
+  /Type /Page
+>>
+endobj
+8 0 obj
+<<
+/Contents 15 0 R /MediaBox [ 0 0 612 792 ] /Parent 11 0 R /Resources <<
+/Font 1 0 R /ProcSet [ /PDF /Text /ImageB /ImageC /ImageI ]
+>> /Rotate 0 /Trans <<
+>>
+  /Type /Page
+>>
+endobj
+9 0 obj
+<<
+/PageMode /UseNone /Pages 11 0 R /Type /Catalog
+>>
+endobj
+10 0 obj
+<<
+/Author (\(anonymous\)) /CreationDate (D:20251107123724-05'00') /Creator (\(unspecified\)) /Keywords () /ModDate (D:20251107123724-05'00') /Producer (ReportLab PDF Library - www.reportlab.com)
+  /Subject (\(unspecified\)) /Title (\(anonymous\)) /Trapped /False
+>>
+endobj
+11 0 obj
+<<
+/Count 4 /Kids [ 5 0 R 6 0 R 7 0 R 8 0 R ] /Type /Pages
+>>
+endobj
+12 0 obj
+<<
+/Filter [ /ASCII85Decode /FlateDecode ] /Length 1165
+>>
+stream
+Gatm;D,\q<&H9tYfItGk,;i'AVWAa(U(/St<Bi7P?9$s:"kjr:s5%U]T"UmN;ir"ACj)G!ZItR0J`)<PT1K7V2M*mV.8U5r%8cJ3Lb@U:g:Q7;<n$Y+:l`[H_ek0aSC.D-O:%Tf.$o::5rZ"RPT_:S7VYK6C;:s6GeFtHi8EZ@7.O3mWRN]NWlX-EO4/6J.ek@ZMGh3@kSHXBE7D-Z7UP\l`QA]n_ho;a!"h>_+[QX#K6*I(1=gIHkkK5h+LAU0JV_q!Q['-X<'5=->[=cpH<pEd0d558]2jrc,!LG^E/9Jiobaj'd,]&V0ZGup^sMuuSCoC8ic73'L,'j\TGtA6?lfjKH33)\XSs4of;m`i[6EFajW'<G74I3XE%?6h7jhR*X[CYEl5I5=mhK.Hr/g#,-2P-hX\G]Z%-irab4e#\V2eQi$h+Q2l5F_;UoUega)dLq]q3Nd_Ka!BqpQC8-FXIKXKjg8&I/^9%ho.(QC$kK;u%^R/C8JCBNlF)H,+_u;g;C)2ejN+8gdQA8CK#lE''-Q)J2Q<:1msc.%/@(0U?t[9'4[(Jd_Ase&_`E^'NM`;Vu1Cg_2i1LGSHbg(#/kb?SsHnn?Wp)0sqca,c,:@Vrc:UXLF)HeSkY<aZjE81(gV9&fL>Zq1&$lgAP1)pI8om(us?l8sBC#aRF*l=WsH&9O3+2S:Z6eJ1GR3+D1ORB2IAaCk%E4rlbkfuEkaA6#UV$'EA"ceH5qPZH*VRS,c`mn#A`XSep:G(q<,bu=Fc[ae\Ig3T;FmOtEN/JcBtA:SlDWohP>i+0ALjYF"%m&4jJH`'$%5s'6[3OtXB<Qh@@$2g$&H)jG-b\&%bBCt:s7_7me$N_=4Tq1DS[dHi]E+G7i4%B^Wrk)\7L26H#dN-$Wg)ooTRTc<IEg>i':5oj8<qg;mm^/.SG0b27(3f(Q/?:]6jYpat--No"52mD"7^K+)BY5p;Hlcsd>kd=R%]HcP"(b3k?=8o!<2IVfbGpSrM<<AjWE.VqC:oP:.>-rS[L\4'GgSaV."&o#?T'.CS`:g,2[Pq@'WkFcb`%cg);117N_ZqXe<48]=XgLWC$8\C/m=\4XUH#J%**!*_aQ(s""V:.`_4rF$!TUCDtm.Y_l'DNqHcsuE4u;D?U:bkCg=E&$oC`3qZDrk.l%~>endstream
+endobj
+13 0 obj
+<<
+/Filter [ /ASCII85Decode /FlateDecode ] /Length 1372
+>>
+stream
+Gat=*gN)%,&:N/3lq:t*9o8q`bAXPQ2JM!8-;p$DcZC9BU6(P2c&4I%o]*O`6&Dm<7]BseoO;f`JN\>["r';Gr/GcJT@WMo(C&*7obNob4BULYg\'oI`iQ'[5j12kN&GEeANI_Cm"g=9F$t#/-lQU0Wll`lS!%Ef@/U8[$Out65::-QN$*)m-+6Q_Bs8SH:G3jQj6DHl:SLNn>i52CHYs8`Y`?3]LKstp\CL][/;jtpa@t0?#sdeu\A\>S%+AK#T6F#TeWm@^MAqXr'\5"n4MNDr<[L/P@KOmPL(7@Hiq^UUXf=e'aTQu,1>P[lAa<+um3=f_MLIO_9>K=5k^oc*<9!_Uin<:kW1Xd'f#r]g&kKZi9W5>T_'?.!;6&n=E0]?@Y@U_WQtNW<>^KMH<5Z*@jT*ba=PQ^<B2N`<M$3ehFXcA9As1AE&.Og`)CI>9OkJDG^rZNPo]Z%+B8m`Y.PB-<q)O./[u^8qo>Z@KS;:!&"rhm.AQ:,0_qI+m!nImCfmYQrjI;7^h^9QdKCn@^e#OUn^I+AEg3?Y8MO8=NN0acOafcp(d#cgCS61$:,4!.AJ9CCC;",'r>`;%Yrp1V%;su[CI[Le?Z=Ui0jn%N934tk4NPh'DK>Rc^QGL^GWO:,0$gmbsUR*bSFZM%eFk4XC7XXH:fFk:q?TO\&X:8MtS*O2r^)Wo$?'*FeR-0lboWu_R5_?Y!'%;^kmB3i]bm4dJ-Nlc&*b,cnju`?]TaP2-$][7!Y_f,M(IX3>_pSit'hYKJ8pf@7+Tp0>;kIGg'j]hZF5N0qY%=o%(#\Oi+]VdG3,:#>//AV<R)c,R!OPfj]HWne6+&6:OPq"H:h6W(!hL2XKqqpG[fWR>'%_>T'0I*(Y"1#DIj&]dHDp*29(K=0DD598L;7Ym=0[Gjp=O\YU;@TVHDe,>s.'iYk*QiC!W]*VO\b6@\2WT38Bu7Co/&$fb0<EE,p!)13o\7Ug"Jm(2f5-sZ49/kJcT2f*c(0#GK9Mb*8nNI9*h_ZEq8^>SI"[b"+dH#g:o<1kn\6nq5NQf[_m3O&c.3G7@NaKMT+TC"hCk+s'g\/XQ]Fe8+&F4*W>Gb]dS5=LJhqE,Th?nBBNVp_(]E6gP/L7`K:NPa7BP^s*Q'?VEbK\mK>rb)s!P<1T:Mq4'qIpLWgJ6anrlj&D.q6"'$qFG0%7D5/sYh[W=&Z4n_rs;2T8!mA.28HHg"_jMd,CZ8S:MS,m4]Ce=WH[Cb=kf2.)gh[%-4+_hQQnfubob]p%0Ynq\j\A:Nf<H=#"Tk_4$d'',qf4>['b%]C:[8EtO94/u@7W@Rr7$1)8c-E@&o>[_Z^(Ko]Onp04&3aU_F`9.&o7I6F66tPW+(4da4-<5^~>endstream
+endobj
+14 0 obj
+<<
+/Filter [ /ASCII85Decode /FlateDecode ] /Length 791
+>>
+stream
+GatU1?#Q2d'Re<2\1a+&UbI:,j1_AW2Jq`dI^$:Q'*5S;-.!:)Y5`qe@o?tA>.1(,kAWEDg72q,&3Qoi:"B:l&\S-7JMN%!J-oXKF:KCeg#J#PAfsCe&BTb(7,g^f:UHc#qD;S>K'IN66?K8f.VmOFW+^=k7WlHC$t%3-jKP`O.U,'d=]rCikRJf?Z+S$8eWqrs;K1g"'ao[!?\a;Meso+IK>tAc[cD*I@"IL`^l:8t<huk?Q9f=)bY>6AX+Z![`Qg+dP@dC'd^I0U?=DY.NJ^Q^E&+>:r03WR?[R3Fg:Yt'/qdr>cs%JL/W6)L2+:saY/dM5?Q:(e7VM6P/(\cA_"Wg[&%6?j,*VpSZ6[_FSs;lCimBsRb@/75%t'kO[lDdLb?k4:^*\iI8`SP.gSD53];WkP]ZTfHIY!^*`:n!A4:sQp5_N?'li^\*P\[:p:gMd'R]nK!"k>5C[(3jd2a+mdPh:lJ$AT#DOVhS4>&Oa2nhApBatMhh=9=96=S?SP>PZ+_IT"XY6c&)o-qlcVl1iD6UQQg7QR)H6=7O5'31fcmW"a+f%HT^5f.lc7.E@Vc'3/lk)"XsdjrR_]2:]40EdHe[&9S=en9a>if[VAIo=PO[g;^'/j`#'6i\3,Q8<;5+h7Z_l%`CYg.HrN+FBG1UL4]urTu1ir@@1nEU-!!;>A$'Bc?hTTVuU%7\!)Nm/UgeLpNOM*'5u#;.rIk(\FJPMYC)(of_btS$9W7a.ghS2SA/mGNTX"Sdsj9IP3)Oik4@`;k'>!$[M:XE0`(qk%b,@~>endstream
+endobj
+15 0 obj
+<<
+/Filter [ /ASCII85Decode /FlateDecode ] /Length 695
+>>
+stream
+GatUp:NOu=&B4,6'RQU"MQok7Uu;HQ"_K^oj5.9odO#Qk-)3\=4i<%f-+=n`.6`u@iacTEgY[A7'n_ArQLCOMGi-C(9*VJPFpi+mcCt1<([^S@N8<l+1pCL@g.Z9ne^.q8AT]nDZrjV&:&i#rAgG;9Zq7)=h3o%amOG3-Hq.VW%ZTO8=lWjUq\.[-<Vo8;UPd<r*\19:m9$":'d2uq(T:8`2^PM-_J=)Jj9JmATdW+(h&`:&kQ4g4$m<(#U0,;n&L`JGLoJus=Fn'b%d;DCM+e0qL:rin[(8`2Nbaa!Ms,S8BH?`j$M4iuWa&Y0&[PZ(-mq_IP:#Un4H@-.WH^\52aj[fJI,:p4i.__`C7jDn?s_)as1jM'CLUF^@G;HPh?)p.':??brC2*@o8aDY1ips%fr5NPWFj>XaB#8_o;;ofkBFqgc-#]fO"A]h/Jg^]a?K,8Ue*YbW7s_>X+kU6:Ps%Mt`GP(]/l670uS^`,f-BoMGi(go'?bF?d3lV[c<g.9IP,.*t&p"2_[r&3Wd0=9"Vo1=8Q%Gi3nl.tLXTqbZfJY8obchCGR\[q&tb25^K5>&E&.+nda_[&H^96=8Y;6u&Xa-<Vm83]IEEGeL)rFZMMu'g$1J*<6#ujC#4gbN25N8:W*s1F,V1lDbBjIF')VlEZ$!n9#_jA@5DXY(;js*?R$jqZH9o@.+~>endstream
+endobj
+xref
+0 16
+0000000000 65535 f
+0000000073 00000 n
+0000000124 00000 n
+0000000231 00000 n
+0000000343 00000 n
+0000000462 00000 n
+0000000657 00000 n
+0000000852 00000 n
+0000001047 00000 n
+0000001242 00000 n
+0000001311 00000 n
+0000001595 00000 n
+0000001673 00000 n
+0000002930 00000 n
+0000004394 00000 n
+0000005276 00000 n
+trailer
+<<
+/ID
+[<d0b9f46517359c81a4251c675bcb194d><d0b9f46517359c81a4251c675bcb194d>]
+% ReportLab generated PDF document -- digest (http://www.reportlab.com)
+/Info 10 0 R
+/Root 9 0 R
+/Size 16
+>>
+startxref
+6062
+%%EOF

detector/attribution.py CHANGED Viewed

@@ -77,7 +77,7 @@ class ModelAttributor:
     - Confidence-weighted aggregation
     - Explainable reasoning
     """
-    # DOCUMENT-ALIGNED: Metric weights from technical specification
     METRIC_WEIGHTS           = {"perplexity"                   : 0.25,
                                 "structural"                   : 0.15,
                                 "semantic_analysis"            : 0.15,
@@ -86,7 +86,7 @@ class ModelAttributor:
                                 "multi_perturbation_stability" : 0.10,
                                }
-    # DOMAIN-AWARE model patterns for ALL 16 DOMAINS
     DOMAIN_MODEL_PREFERENCES = {Domain.GENERAL       : [AIModel.GPT_4, AIModel.CLAUDE_3_SONNET, AIModel.GEMINI_PRO, AIModel.GPT_3_5],
                                 Domain.ACADEMIC      : [AIModel.GPT_4, AIModel.CLAUDE_3_OPUS, AIModel.GEMINI_ULTRA, AIModel.GPT_4_TURBO],
                                 Domain.TECHNICAL_DOC : [AIModel.GPT_4_TURBO, AIModel.CLAUDE_3_SONNET, AIModel.LLAMA_3, AIModel.GPT_4],
@@ -105,7 +105,7 @@ class ModelAttributor:
                                 Domain.TUTORIAL      : [AIModel.GPT_4, AIModel.CLAUDE_3_SONNET, AIModel.GEMINI_PRO, AIModel.GPT_4_TURBO],
                                }
-    # Enhanced Model-specific fingerprints with comprehensive patterns
     MODEL_FINGERPRINTS = {AIModel.GPT_3_5       : {"phrases"              : ["as an ai language model",
                                                                              "i don't have personal opinions",
                                                                              "it's important to note that",
@@ -460,13 +460,13 @@ class ModelAttributor:
                                                                                      domain             = domain,
                                                                                     )
-            # Domain-aware prediction - FIXED: Always show the actual highest probability model
             predicted_model, confidence           = self._make_domain_aware_prediction(combined_scores    = combined_scores,
                                                                                        domain             = domain,
                                                                                        domain_preferences = domain_preferences,
                                                                                       )
-            # Reasoning with domain context - FIXED
             reasoning                             = self._generate_detailed_reasoning(predicted_model      = predicted_model,
                                                                                       confidence           = confidence,
                                                                                       domain               = domain,
@@ -490,7 +490,7 @@ class ModelAttributor:
     def _calculate_fingerprint_scores(self, text: str, domain: Domain) -> Dict[AIModel, float]:
         """
-        Calculate fingerprint match scores with DOMAIN CALIBRATION - FIXED for all domains
         """
         scores             = {model: 0.0 for model in AIModel if model not in [AIModel.HUMAN, AIModel.UNKNOWN]}
@@ -812,7 +812,7 @@ class ModelAttributor:
     def _make_domain_aware_prediction(self, combined_scores: Dict[str, float], domain: Domain, domain_preferences: List[AIModel]) -> Tuple[AIModel, float]:
         """
-        Domain aware prediction that considers domain-specific model preferences - FIXED
         """
         if not combined_scores:
             return AIModel.UNKNOWN, 0.0
@@ -825,109 +825,103 @@ class ModelAttributor:
         best_model_name, best_score = sorted_models[0]
-        # FIXED: Only return UNKNOWN if the best score is very low
-        # Use a more reasonable threshold for attribution
-        if best_score < 0.05:  # Changed from 0.08 to 0.05 to be less restrictive
             return AIModel.UNKNOWN, best_score
-        # FIXED: Don't override with domain preferences if there's a clear winner
-        # Only consider domain preferences if scores are very close
-        if len(sorted_models) > 1:
-            second_model_name, second_score = sorted_models[1]
-            score_difference = best_score - second_score
-            # If scores are very close (within 3%) and second is domain-preferred, consider it
-            if score_difference < 0.03:
-                try:
-                    best_model = AIModel(best_model_name)
-                    second_model = AIModel(second_model_name)
-                    # If second model is domain-preferred and first is not, prefer second
-                    if (second_model in domain_preferences and
-                        best_model not in domain_preferences):
-                        best_model_name = second_model_name
-                        best_score = second_score
-                except ValueError:
-                    pass
         try:
             best_model = AIModel(best_model_name)
         except ValueError:
             best_model = AIModel.UNKNOWN
-        # Calculate confidence based on score dominance
-        if len(sorted_models) > 1:
             second_score = sorted_models[1][1]
-            margin = best_score - second_score
-            # Confidence based on both absolute score and margin
-            confidence = min(1.0, best_score * 0.6 + margin * 2.0)
         else:
-            confidence = best_score * 0.7
-        # FIXED: Don't downgrade to UNKNOWN based on confidence alone
-        # If we have a model with reasonable probability, show it even with low confidence
-        return best_model, confidence
     def _generate_detailed_reasoning(self, predicted_model: AIModel, confidence: float, domain: Domain, metric_contributions: Dict[str, float],
                                      combined_scores: Dict[str, float]) -> List[str]:
         """
-        Generate Explainable reasoning - FIXED to show proper formatting
         """
         reasoning = []
         reasoning.append("**AI Model Attribution Analysis**")
         reasoning.append("")
-        reasoning.append(f"**Domain**: {domain.value.replace('_', ' ').title()}")
-        reasoning.append("")
         # Show prediction with confidence
         if (predicted_model == AIModel.UNKNOWN):
             reasoning.append("**Most Likely**: Unable to determine with high confidence")
-            reasoning.append("")
-            reasoning.append("**Top Candidates:**")
         else:
             model_name = predicted_model.value.replace("-", " ").replace("_", " ").title()
             reasoning.append(f"**Predicted Model**: {model_name}")
             reasoning.append(f"**Confidence**: {confidence*100:.1f}%")
-            reasoning.append("")
-            reasoning.append("**Model Probability Distribution:**")
         reasoning.append("")
-        # Show top candidates in proper format
         if combined_scores:
             sorted_models = sorted(combined_scores.items(), key = lambda x: x[1], reverse = True)
             for i, (model_name, score) in enumerate(sorted_models[:6]):
-                # Skip very low probability models
                 if (score < 0.01):
                     continue
                 display_name = model_name.replace("-", " ").replace("_", " ").title()
                 percentage   = score * 100
-                # Single line format: "• Model Name: XX.X%"
                 reasoning.append(f"• **{display_name}**: {percentage:.1f}%")
         reasoning.append("")
-        # Domain-specific insights - FIXED: Removed duplicate header
         reasoning.append("**Analysis Notes:**")
-        reasoning.append(f"• Calibrated for {domain.value.replace('_', ' ')} domain")
-        if (domain in [Domain.ACADEMIC, Domain.TECHNICAL_DOC, Domain.AI_ML, Domain.SOFTWARE_DEV, Domain.ENGINEERING, Domain.SCIENCE]):
-            reasoning.append("• Higher weight on structural coherence and technical patterns")
-        elif (domain in [Domain.CREATIVE, Domain.MARKETING, Domain.SOCIAL_MEDIA, Domain.BLOG_PERSONAL]):
-            reasoning.append("• Emphasis on linguistic diversity and stylistic variation")
-        elif (domain in [Domain.LEGAL, Domain.MEDICAL]):
-            reasoning.append("• Focus on formal language and specialized terminology")
-        elif (domain in [Domain.BUSINESS, Domain.JOURNALISM, Domain.TUTORIAL]):
-            reasoning.append("• Balanced analysis across multiple attribution factors")
         return reasoning

     - Confidence-weighted aggregation
     - Explainable reasoning
     """
+    # Metric weights from technical specification
     METRIC_WEIGHTS           = {"perplexity"                   : 0.25,
                                 "structural"                   : 0.15,
                                 "semantic_analysis"            : 0.15,
                                 "multi_perturbation_stability" : 0.10,
                                }
+    # Domain-aware model patterns for ALL 16 DOMAINS
     DOMAIN_MODEL_PREFERENCES = {Domain.GENERAL       : [AIModel.GPT_4, AIModel.CLAUDE_3_SONNET, AIModel.GEMINI_PRO, AIModel.GPT_3_5],
                                 Domain.ACADEMIC      : [AIModel.GPT_4, AIModel.CLAUDE_3_OPUS, AIModel.GEMINI_ULTRA, AIModel.GPT_4_TURBO],
                                 Domain.TECHNICAL_DOC : [AIModel.GPT_4_TURBO, AIModel.CLAUDE_3_SONNET, AIModel.LLAMA_3, AIModel.GPT_4],
                                 Domain.TUTORIAL      : [AIModel.GPT_4, AIModel.CLAUDE_3_SONNET, AIModel.GEMINI_PRO, AIModel.GPT_4_TURBO],
                                }
+    # Model-specific fingerprints with comprehensive patterns
     MODEL_FINGERPRINTS = {AIModel.GPT_3_5       : {"phrases"              : ["as an ai language model",
                                                                              "i don't have personal opinions",
                                                                              "it's important to note that",
                                                                                      domain             = domain,
                                                                                     )
+            # Domain-aware prediction : Always show the actual highest probability model
             predicted_model, confidence           = self._make_domain_aware_prediction(combined_scores    = combined_scores,
                                                                                        domain             = domain,
                                                                                        domain_preferences = domain_preferences,
                                                                                       )
+            # Reasoning with domain context
             reasoning                             = self._generate_detailed_reasoning(predicted_model      = predicted_model,
                                                                                       confidence           = confidence,
                                                                                       domain               = domain,
     def _calculate_fingerprint_scores(self, text: str, domain: Domain) -> Dict[AIModel, float]:
         """
+        Calculate fingerprint match scores with domain calibration - for all domains
         """
         scores             = {model: 0.0 for model in AIModel if model not in [AIModel.HUMAN, AIModel.UNKNOWN]}
     def _make_domain_aware_prediction(self, combined_scores: Dict[str, float], domain: Domain, domain_preferences: List[AIModel]) -> Tuple[AIModel, float]:
         """
+        Domain aware prediction that considers domain-specific model preferences
         """
         if not combined_scores:
             return AIModel.UNKNOWN, 0.0
         best_model_name, best_score = sorted_models[0]
+        # Thresholding to show model only if confidence is sufficient
+        if (best_score < 0.01):
             return AIModel.UNKNOWN, best_score
         try:
             best_model = AIModel(best_model_name)
         except ValueError:
             best_model = AIModel.UNKNOWN
+        # Calculate confidence - be more generous
+        if (len(sorted_models) > 1):
             second_score = sorted_models[1][1]
+            margin       = best_score - second_score
+            # More generous confidence calculation
+            confidence   = min(1.0, best_score * 0.8 + margin * 1.5)
         else:
+            confidence = best_score * 0.9
+        # Always return the actual best model, never downgrade to UNKNOWN
+        return best_model, max(0.05, confidence)
     def _generate_detailed_reasoning(self, predicted_model: AIModel, confidence: float, domain: Domain, metric_contributions: Dict[str, float],
                                      combined_scores: Dict[str, float]) -> List[str]:
         """
+        Generate Explainable reasoning - ENHANCED version
         """
         reasoning = []
         reasoning.append("**AI Model Attribution Analysis**")
         reasoning.append("")
         # Show prediction with confidence
         if (predicted_model == AIModel.UNKNOWN):
             reasoning.append("**Most Likely**: Unable to determine with high confidence")
         else:
             model_name = predicted_model.value.replace("-", " ").replace("_", " ").title()
             reasoning.append(f"**Predicted Model**: {model_name}")
             reasoning.append(f"**Confidence**: {confidence*100:.1f}%")
+        reasoning.append(f"**Domain**: {domain.value.replace('_', ' ').title()}")
+        reasoning.append("")
+        # Show model probability distribution
+        reasoning.append("**Model Probability Distribution:**")
         reasoning.append("")
         if combined_scores:
             sorted_models = sorted(combined_scores.items(), key = lambda x: x[1], reverse = True)
             for i, (model_name, score) in enumerate(sorted_models[:6]):
+                # Skip very low probabilities
                 if (score < 0.01):
                     continue
                 display_name = model_name.replace("-", " ").replace("_", " ").title()
                 percentage   = score * 100
+                # Use proper markdown formatting
                 reasoning.append(f"• **{display_name}**: {percentage:.1f}%")
         reasoning.append("")
+        # Add analysis insights
         reasoning.append("**Analysis Notes:**")
+        if (confidence < 0.3):
+            reasoning.append("• Low confidence attribution - text patterns are ambiguous")
+            reasoning.append("• May be human-written or from multiple AI sources")
+        else:
+            reasoning.append(f"• Calibrated for {domain.value.replace('_', ' ')} domain")
+            # Domain-specific insights
+            domain_insights = {Domain.ACADEMIC      : "Academic writing patterns analyzed",
+                               Domain.TECHNICAL_DOC : "Technical coherence and structure weighted",
+                               Domain.CREATIVE      : "Stylistic and linguistic diversity emphasized",
+                               Domain.SOCIAL_MEDIA  : "Casual language and engagement patterns considered",
+                               Domain.AI_ML         : "Technical terminology and analytical patterns emphasized",
+                               Domain.SOFTWARE_DEV  : "Code-like structures and technical precision weighted",
+                               Domain.ENGINEERING   : "Technical specifications and formal language analyzed",
+                               Domain.SCIENCE       : "Scientific terminology and methodological patterns considered",
+                               Domain.BUSINESS      : "Professional communication and strategic language weighted",
+                               Domain.LEGAL         : "Formal language and legal terminology emphasized",
+                               Domain.MEDICAL       : "Medical terminology and clinical language analyzed",
+                               Domain.JOURNALISM    : "News reporting style and factual presentation weighted",
+                               Domain.MARKETING     : "Persuasive language and engagement patterns considered",
+                               Domain.BLOG_PERSONAL : "Personal voice and conversational style analyzed",
+                               Domain.TUTORIAL      : "Instructional clarity and step-by-step structure weighted",
+                              }
+            insight         = domain_insights.get(domain, "Multiple attribution factors analyzed")
+            reasoning.append(f"• {insight}")
         return reasoning

detector/highlighter.py CHANGED Viewed

@@ -48,14 +48,14 @@ class TextHighlighter:
     - Explainable tooltips
     - Highlighting metrics calculation
     """
-    # Color thresholds with mixed content support
     COLOR_THRESHOLDS = [(0.00, 0.10, "very-high-human", "#dcfce7", "Very likely human-written"),
                         (0.10, 0.25, "high-human", "#bbf7d0", "Likely human-written"),
                         (0.25, 0.40, "medium-human", "#86efac", "Possibly human-written"),
                         (0.40, 0.60, "uncertain", "#fef9c3", "Uncertain"),
                         (0.60, 0.75, "medium-ai", "#fde68a", "Possibly AI-generated"),
                         (0.75, 0.90, "high-ai", "#fed7aa", "Likely AI-generated"),
-                        (0.90, 1.01, "very-high-ai", "#fecaca", "Very likely AI-generated"),
                        ]
     # Mixed content pattern
@@ -86,11 +86,23 @@ class TextHighlighter:
         self.text_processor     = TextProcessor()
         self.domain             = domain
         self.domain_thresholds  = get_threshold_for_domain(domain)
-        self.ensemble           = ensemble_classifier or EnsembleClassifier(primary_method  = "confidence_calibrated",
-                                                                            fallback_method = "domain_weighted",
-                                                                           )
     def generate_highlights(self, text: str, metric_results: Dict[str, MetricResult], ensemble_result: Optional[EnsembleResult] = None,
                             enabled_metrics: Optional[Dict[str, bool]] = None, use_sentence_level: bool = True) -> List[HighlightedSentence]:
         """
@@ -112,80 +124,197 @@ class TextHighlighter:
         --------
                          { list }                 : List of HighlightedSentence objects
         """
-        # Get domain-appropriate weights for enabled metrics
-        if enabled_metrics is None:
-            enabled_metrics = {name: True for name in metric_results.keys()}
-        weights   = get_active_metric_weights(self.domain, enabled_metrics)
-        # Split text into sentences
-        sentences = self._split_sentences(text)
-        if not sentences:
-            return []
-        # Calculate probabilities for each sentence using ENSEMBLE METHODS
-        highlighted_sentences = list()
-        for idx, sentence in enumerate(sentences):
-            if use_sentence_level:
-                # Use ENSEMBLE for sentence-level analysis
-                ai_prob, human_prob, mixed_prob, confidence, breakdown = self._calculate_sentence_ensemble_probability(sentence        = sentence,
-                                                                                                                       metric_results  = metric_results,
-                                                                                                                       weights         = weights,
-                                                                                                                       ensemble_result = ensemble_result,
-                                                                                                                      )
-            else:
-                # Use document-level ensemble probabilities
-                ai_prob, human_prob, mixed_prob, confidence, breakdown = self._get_document_ensemble_probability(ensemble_result = ensemble_result,
-                                                                                                                 metric_results  = metric_results,
-                                                                                                                 weights         = weights,
-                                                                                                                )
-            # Apply domain-specific adjustments
-            ai_prob = self._apply_domain_specific_adjustments(sentence, ai_prob, len(sentence.split()))
-            # Determine if this is mixed content
-            is_mixed_content                     = (mixed_prob > self.MIXED_THRESHOLD)
-            # Get confidence level
-            confidence_level                     = get_confidence_level(confidence)
-            # Get color class (consider mixed content)
-            color_class, color_hex, tooltip_base = self._get_color_for_probability(probability      = ai_prob,
-                                                                                   is_mixed_content = is_mixed_content,
-                                                                                   mixed_prob       = mixed_prob,
-                                                                                  )
-            # Generate enhanced tooltip
-            tooltip                              = self._generate_ensemble_tooltip(sentence         = sentence,
-                                                                                   ai_prob          = ai_prob,
-                                                                                   human_prob       = human_prob,
-                                                                                   mixed_prob       = mixed_prob,
-                                                                                   confidence       = confidence,
-                                                                                   confidence_level = confidence_level,
-                                                                                   tooltip_base     = tooltip_base,
-                                                                                   breakdown        = breakdown,
-                                                                                   is_mixed_content = is_mixed_content,
-                                                                                  )
-            highlighted_sentences.append(HighlightedSentence(text              = sentence,
-                                                             ai_probability    = ai_prob,
-                                                             human_probability = human_prob,
-                                                             mixed_probability = mixed_prob,
-                                                             confidence        = confidence,
-                                                             confidence_level  = confidence_level,
-                                                             color_class       = color_class,
-                                                             tooltip           = tooltip,
-                                                             index             = idx,
-                                                             is_mixed_content  = is_mixed_content,
-                                                             metric_breakdown  = breakdown,
-                                                            )
-                                        )
-        return highlighted_sentences
     def _calculate_sentence_ensemble_probability(self, sentence: str, metric_results: Dict[str, MetricResult], weights: Dict[str, float],
                                                  ensemble_result: Optional[EnsembleResult] = None) -> Tuple[float, float, float, float, Dict[str, float]]:
         """
@@ -193,10 +322,24 @@ class TextHighlighter:
         """
         sentence_length = len(sentence.split())
-        # IMPROVED: Better handling of short sentences
         if (sentence_length < 3):
-            # Return neutral probability for very short sentences with low confidence
-            return 0.5, 0.5, 0.0, 0.3, {"short_sentence": 0.5}
         # Calculate sentence-level metric results
         sentence_metric_results = dict()
@@ -204,20 +347,27 @@ class TextHighlighter:
         for name, doc_result in metric_results.items():
             if doc_result.error is None:
-                # Compute sentence-level probability for this metric
-                sentence_prob                 = self._compute_sentence_metric(metric_name = name,
-                                                                              sentence    = sentence,
-                                                                              result      = doc_result,
-                                                                              weight      = weights.get(name, 0.0),
-                                                                             )
-                # Create sentence-level MetricResult
-                sentence_metric_results[name] = self._create_sentence_metric_result(metric_name = name,
-                                                                                    ai_prob     = sentence_prob,
-                                                                                    doc_result  = doc_result,
-                                                                                   )
-                breakdown[name]               = sentence_prob
         # Use ensemble to combine sentence-level metrics
         if sentence_metric_results:
@@ -226,8 +376,11 @@ class TextHighlighter:
                                                                  domain         = self.domain,
                                                                 )
-                return (ensemble_sentence_result.ai_probability, ensemble_sentence_result.human_probability, ensemble_sentence_result.mixed_probability,
-                        ensemble_sentence_result.overall_confidence, breakdown)
             except Exception as e:
                 logger.warning(f"Sentence ensemble failed: {e}")
@@ -262,12 +415,12 @@ class TextHighlighter:
         return adjusted_prob
-    def _create_sentence_metric_result(self, metric_name: str, ai_prob: float, doc_result: MetricResult) -> MetricResult:
         """
         Create sentence-level MetricResult from document-level result
         """
-        # Adjust confidence based on sentence characteristics
-        sentence_confidence = self._calculate_sentence_confidence(doc_result.confidence)
         return MetricResult(metric_name       = metric_name,
                             ai_probability    = ai_prob,
@@ -279,12 +432,15 @@ class TextHighlighter:
                            )
-    def _calculate_sentence_confidence(self, doc_confidence: float) -> float:
         """
-        Calculate confidence for sentence-level analysis
         """
-        # Sentence-level analysis typically has lower confidence
-        return max(0.1, doc_confidence * 0.8)
     def _calculate_weighted_probability(self, metric_results: Dict[str, MetricResult], weights: Dict[str, float], breakdown: Dict[str, float]) -> Tuple[float, float, float, float, Dict[str, float]]:
@@ -306,8 +462,8 @@ class TextHighlighter:
                     confidences.append(result.confidence)
                     total_weight += weight
-        if not weighted_ai_probs or total_weight == 0:
-            return 0.5, 0.5, 0.0, 0.5, {}
         ai_prob        = sum(weighted_ai_probs) / total_weight
         human_prob     = sum(weighted_human_probs) / total_weight
@@ -331,84 +487,94 @@ class TextHighlighter:
         else:
             # Calculate from metrics
             return self._calculate_weighted_probability(metric_results, weights, {})
     def _apply_domain_specific_adjustments(self, sentence: str, ai_prob: float, sentence_length: int) -> float:
         """
-        Apply domain-specific adjustments to AI probability - UPDATED FOR ALL DOMAINS
         """
         sentence_lower = sentence.lower()
         # Technical & AI/ML domains
-        if self.domain in [Domain.AI_ML, Domain.SOFTWARE_DEV, Domain.TECHNICAL_DOC, Domain.ENGINEERING, Domain.SCIENCE]:
             if self._has_technical_terms(sentence_lower):
-                # Technical terms more common in AI
-                ai_prob *= 1.1
             elif self._has_code_like_patterns(sentence):
-                ai_prob *= 1.15
-            elif sentence_length > 35:
-                ai_prob *= 1.05
         # Creative & informal domains
-        elif self.domain in [Domain.CREATIVE, Domain.SOCIAL_MEDIA, Domain.BLOG_PERSONAL]:
             if self._has_informal_language(sentence_lower):
-                # Informal language more human-like
-                ai_prob *= 0.7
             elif self._has_emotional_language(sentence):
-                ai_prob *= 0.8
             elif (sentence_length < 10):
-                ai_prob *= 0.8
         # Academic & formal domains
-        elif self.domain in [Domain.ACADEMIC, Domain.LEGAL, Domain.MEDICAL]:
             if self._has_citation_patterns(sentence):
-                # Citations more human-like
-                ai_prob *= 0.8
             elif self._has_technical_terms(sentence_lower):
-                ai_prob *= 1.1
             elif (sentence_length > 40):
-                ai_prob *= 1.1
         # Business & professional domains
-        elif self.domain in [Domain.BUSINESS, Domain.MARKETING, Domain.JOURNALISM]:
             if self._has_business_jargon(sentence_lower):
-                # Jargon can be AI-like
-                ai_prob *= 1.05
             elif self._has_ambiguous_phrasing(sentence_lower):
-                # Ambiguity more human
-                ai_prob *= 0.9
             elif (15 <= sentence_length <= 25):
-                ai_prob *= 0.9
         # Tutorial & educational domains
         elif (self.domain == Domain.TUTORIAL):
             if self._has_instructional_language(sentence_lower):
-                # Instructional tone more human
-                ai_prob *= 0.85
             elif self._has_step_by_step_pattern(sentence):
-                ai_prob *= 0.8
             elif self._has_examples(sentence):
-                ai_prob *= 0.9
         # General domain - minimal adjustments
-        elif self.domain == Domain.GENERAL:
             if self._has_complex_structure(sentence):
-                ai_prob *= 0.9
             elif self._has_repetition(sentence):
-                ai_prob *= 1.1
-        return max(0.0, min(1.0, ai_prob))
     def _apply_metric_specific_adjustments(self, metric_name: str, sentence: str, base_prob: float, sentence_length: int, thresholds: MetricThresholds) -> float:
@@ -466,8 +632,12 @@ class TextHighlighter:
     def _get_color_for_probability(self, probability: float, is_mixed_content: bool = False, mixed_prob: float = 0.0) -> Tuple[str, str, str]:
         """
-        Get color class with mixed content support
         """
         # Check mixed content first
         if (is_mixed_content and (mixed_prob > self.MIXED_THRESHOLD)):
             return "mixed-content", "#e9d5ff", f"Mixed AI/Human content ({mixed_prob:.1%} mixed)"
@@ -477,12 +647,12 @@ class TextHighlighter:
             if (min_thresh <= probability < max_thresh):
                 return color_class, color_hex, tooltip
-        # Fallback
-        return "uncertain", "#fef9c3", "Uncertain"
     def _generate_ensemble_tooltip(self, sentence: str, ai_prob: float, human_prob: float, mixed_prob: float, confidence: float, confidence_level: ConfidenceLevel,
-                                  tooltip_base: str, breakdown: Optional[Dict[str, float]] = None, is_mixed_content: bool = False) -> str:
         """
         Generate enhanced tooltip with ENSEMBLE information
         """
@@ -504,7 +674,7 @@ class TextHighlighter:
             for metric, prob in list(breakdown.items())[:4]:
                 tooltip += f"\n• {metric}: {prob:.1%}"
-        tooltip += f"\n\nEnsemble Method: {self.ensemble.primary_method}"
         return tooltip
@@ -619,7 +789,7 @@ class TextHighlighter:
         Analyze sentence complexity (0 = simple, 1 = complex)
         """
         words = sentence.split()
-        if len(words) < 5:
             return 0.2
         complexity_indicators = ['although', 'because', 'while', 'when', 'if', 'since', 'unless', 'until', 'which', 'that', 'who', 'whom', 'whose', 'and', 'but', 'or', 'yet', 'so', 'however', 'therefore', 'moreover', 'furthermore', 'nevertheless', ',', ';', ':', '—']
@@ -637,7 +807,7 @@ class TextHighlighter:
         clause_indicators = [',', ';', 'and', 'but', 'or', 'because', 'although']
         clause_count      = sum(1 for indicator in clause_indicators if indicator in sentence.lower())
-        score             += min(0.2, clause_count * 0.05)
         return min(1.0, score)
@@ -671,7 +841,7 @@ class TextHighlighter:
         for sentence in sentences:
             clean_sentence = sentence.strip()
-            if (len(clean_sentence) >= 10):
                 filtered_sentences.append(clean_sentence)
         return filtered_sentences
@@ -1002,7 +1172,7 @@ class TextHighlighter:
         total_sentences = len(highlighted_sentences)
         # Calculate weighted risk score
-        weighted_risk = 0.0
         for sent in highlighted_sentences:
             weight         = self.RISK_WEIGHTS.get(sent.color_class, 0.4)

     - Explainable tooltips
     - Highlighting metrics calculation
     """
+    # Color thresholds with mixed content support - FIXED: No gaps
     COLOR_THRESHOLDS = [(0.00, 0.10, "very-high-human", "#dcfce7", "Very likely human-written"),
                         (0.10, 0.25, "high-human", "#bbf7d0", "Likely human-written"),
                         (0.25, 0.40, "medium-human", "#86efac", "Possibly human-written"),
                         (0.40, 0.60, "uncertain", "#fef9c3", "Uncertain"),
                         (0.60, 0.75, "medium-ai", "#fde68a", "Possibly AI-generated"),
                         (0.75, 0.90, "high-ai", "#fed7aa", "Likely AI-generated"),
+                        (0.90, 1.00, "very-high-ai", "#fecaca", "Very likely AI-generated"),
                        ]
     # Mixed content pattern
         self.text_processor     = TextProcessor()
         self.domain             = domain
         self.domain_thresholds  = get_threshold_for_domain(domain)
+        self.ensemble           = ensemble_classifier or self._create_default_ensemble()
+    def _create_default_ensemble(self) -> EnsembleClassifier:
+        """
+        Create default ensemble classifier with proper error handling
+        """
+        try:
+            return EnsembleClassifier(primary_method  = "confidence_calibrated",
+                                      fallback_method = "domain_weighted",
+                                     )
+        except Exception as e:
+            logger.warning(f"Failed to create default ensemble: {e}. Using fallback mode.")
+            # Return a minimal ensemble or raise based on requirements
+            return EnsembleClassifier(primary_method = "weighted_average")
     def generate_highlights(self, text: str, metric_results: Dict[str, MetricResult], ensemble_result: Optional[EnsembleResult] = None,
                             enabled_metrics: Optional[Dict[str, bool]] = None, use_sentence_level: bool = True) -> List[HighlightedSentence]:
         """
         --------
                          { list }                 : List of HighlightedSentence objects
         """
+        try:
+            # Validate inputs
+            if not text or not text.strip():
+                return self._handle_empty_text(text, metric_results, ensemble_result)
+            # Get domain-appropriate weights for enabled metrics
+            if enabled_metrics is None:
+                enabled_metrics = {name: True for name in metric_results.keys()}
+            weights   = get_active_metric_weights(self.domain, enabled_metrics)
+            # Split text into sentences with error handling
+            sentences = self._split_sentences_with_fallback(text)
+            if not sentences:
+                return self._handle_no_sentences(text, metric_results, ensemble_result)
+            # Calculate probabilities for each sentence using ENSEMBLE METHODS
+            highlighted_sentences = list()
+            for idx, sentence in enumerate(sentences):
+                try:
+                    if use_sentence_level:
+                        # Use ENSEMBLE for sentence-level analysis
+                        ai_prob, human_prob, mixed_prob, confidence, breakdown = self._calculate_sentence_ensemble_probability(sentence        = sentence,
+                                                                                                                               metric_results  = metric_results,
+                                                                                                                               weights         = weights,
+                                                                                                                               ensemble_result = ensemble_result,
+                                                                                                                              )
+                    else:
+                        # Use document-level ensemble probabilities
+                        ai_prob, human_prob, mixed_prob, confidence, breakdown = self._get_document_ensemble_probability(ensemble_result = ensemble_result,
+                                                                                                                         metric_results  = metric_results,
+                                                                                                                         weights         = weights,
+                                                                                                                        )
+                    # Apply domain-specific adjustments with limits
+                    ai_prob                              = self._apply_domain_specific_adjustments(sentence        = sentence,
+                                                                                                   ai_prob         = ai_prob,
+                                                                                                   sentence_length = len(sentence.split()),
+                                                                                                  )
+                    # Determine if this is mixed content
+                    is_mixed_content                     = (mixed_prob > self.MIXED_THRESHOLD)
+                    # Get confidence level
+                    confidence_level                     = get_confidence_level(confidence)
+                    # Get color class (consider mixed content)
+                    color_class, color_hex, tooltip_base = self._get_color_for_probability(probability      = ai_prob,
+                                                                                           is_mixed_content = is_mixed_content,
+                                                                                           mixed_prob       = mixed_prob,
+                                                                                          )
+                    # Generate enhanced tooltip
+                    tooltip                              = self._generate_ensemble_tooltip(sentence         = sentence,
+                                                                                           ai_prob          = ai_prob,
+                                                                                           human_prob       = human_prob,
+                                                                                           mixed_prob       = mixed_prob,
+                                                                                           confidence       = confidence,
+                                                                                           confidence_level = confidence_level,
+                                                                                           tooltip_base     = tooltip_base,
+                                                                                           breakdown        = breakdown,
+                                                                                           is_mixed_content = is_mixed_content,
+                                                                                          )
+                    highlighted_sentences.append(HighlightedSentence(text              = sentence,
+                                                                     ai_probability    = ai_prob,
+                                                                     human_probability = human_prob,
+                                                                     mixed_probability = mixed_prob,
+                                                                     confidence        = confidence,
+                                                                     confidence_level  = confidence_level,
+                                                                     color_class       = color_class,
+                                                                     tooltip           = tooltip,
+                                                                     index             = idx,
+                                                                     is_mixed_content  = is_mixed_content,
+                                                                     metric_breakdown  = breakdown,
+                                                                    )
+                                                )
+                except Exception as e:
+                    logger.warning(f"Failed to process sentence {idx}: {e}")
+                    # Add fallback sentence
+                    highlighted_sentences.append(self._create_fallback_sentence(sentence, idx))
+            return highlighted_sentences
+        except Exception as e:
+            logger.error(f"Highlight generation failed: {e}")
+            return self._create_error_fallback(text, metric_results)
+    def _handle_empty_text(self, text: str, metric_results: Dict[str, MetricResult], ensemble_result: Optional[EnsembleResult]) -> List[HighlightedSentence]:
+        """
+        Handle empty input text
+        """
+        if ensemble_result:
+            return [self._create_fallback_sentence(text       = "No text content",
+                                                   index      = 0,
+                                                   ai_prob    = ensemble_result.ai_probability,
+                                                   human_prob = ensemble_result.human_probability,
+                                                  )
+                   ]
+        return [self._create_fallback_sentence("No text content", 0)]
+    def _handle_no_sentences(self, text: str, metric_results: Dict[str, MetricResult], ensemble_result: Optional[EnsembleResult]) -> List[HighlightedSentence]:
+        """
+        Handle case where no sentences could be extracted
+        """
+        if (text and (len(text.strip()) > 0)):
+            # Treat entire text as one sentence
+            return [self._create_fallback_sentence(text.strip(), 0)]
+        return [self._create_fallback_sentence("No processable content", 0)]
+    def _create_fallback_sentence(self, text: str, index: int, ai_prob: float = 0.5, human_prob: float = 0.5) -> HighlightedSentence:
+        """
+        Create a fallback sentence when processing fails
+        """
+        confidence_level             = get_confidence_level(0.3)
+        color_class, _, tooltip_base = self._get_color_for_probability(probability      = ai_prob,
+                                                                       is_mixed_content = False,
+                                                                       mixed_prob       = 0.0,
+                                                                      )
+        return HighlightedSentence(text              = text,
+                                   ai_probability    = ai_prob,
+                                   human_probability = human_prob,
+                                   mixed_probability = 0.0,
+                                   confidence        = 0.3,
+                                   confidence_level  = confidence_level,
+                                   color_class       = color_class,
+                                   tooltip           = f"Fallback: {tooltip_base}\nProcessing failed for this sentence",
+                                   index             = index,
+                                   is_mixed_content  = False,
+                                   metric_breakdown  = {"fallback": ai_prob},
+                                  )
+    def _create_error_fallback(self, text: str, metric_results: Dict[str, MetricResult]) -> List[HighlightedSentence]:
+        """
+        Create fallback when entire processing fails
+        """
+        return [HighlightedSentence(text              = text[:100] + "..." if len(text) > 100 else text,
+                                    ai_probability    = 0.5,
+                                    human_probability = 0.5,
+                                    mixed_probability = 0.0,
+                                    confidence        = 0.1,
+                                    confidence_level  = get_confidence_level(0.1),
+                                    color_class       = "uncertain",
+                                    tooltip           = "Error in text processing",
+                                    index             = 0,
+                                    is_mixed_content  = False,
+                                    metric_breakdown  = {"error": 0.5},
+                                   )
+               ]
+    def _split_sentences_with_fallback(self, text: str) -> List[str]:
+        """
+        Split text into sentences with comprehensive fallback handling
+        """
+        try:
+            sentences          = self.text_processor.split_sentences(text)
+            filtered_sentences = [s.strip() for s in sentences if len(s.strip()) >= 3]
+            if filtered_sentences:
+                return filtered_sentences
+            # Fallback: split by common sentence endings
+            fallback_sentences = re.split(r'[.!?]+', text)
+            fallback_sentences = [s.strip() for s in fallback_sentences if len(s.strip()) >= 3]
+            if fallback_sentences:
+                return fallback_sentences
+            # Ultimate fallback: treat as single sentence if meaningful
+            if text.strip():
+                return [text.strip()]
+            return []
+        except Exception as e:
+            logger.warning(f"Sentence splitting failed, using fallback: {e}")
+            # Return text as single sentence
+            return [text] if text.strip() else []
     def _calculate_sentence_ensemble_probability(self, sentence: str, metric_results: Dict[str, MetricResult], weights: Dict[str, float],
                                                  ensemble_result: Optional[EnsembleResult] = None) -> Tuple[float, float, float, float, Dict[str, float]]:
         """
         """
         sentence_length = len(sentence.split())
+        # Handling short sentences - don't force neutral
         if (sentence_length < 3):
+            # Return probabilities with lower confidence for very short sentences
+            base_ai_prob    = 0.5
+            # Low confidence for very short sentences
+            base_confidence = 0.2
+            breakdown       = {"short_sentence" : base_ai_prob}
+            # Try to get some signal from available metrics
+            for name, result in metric_results.items():
+                if ((result.error is None) and (weights.get(name, 0) > 0)):
+                    base_ai_prob    = result.ai_probability
+                    breakdown[name] = base_ai_prob
+                    break
+            return base_ai_prob, 1.0 - base_ai_prob, 0.0, base_confidence, breakdown
         # Calculate sentence-level metric results
         sentence_metric_results = dict()
         for name, doc_result in metric_results.items():
             if doc_result.error is None:
+                try:
+                    # Compute sentence-level probability for this metric
+                    sentence_prob                 = self._compute_sentence_metric(metric_name = name,
+                                                                                  sentence    = sentence,
+                                                                                  result      = doc_result,
+                                                                                  weight      = weights.get(name, 0.0),
+                                                                                 )
+                    # Create sentence-level MetricResult
+                    sentence_metric_results[name] = self._create_sentence_metric_result(metric_name = name,
+                                                                                        ai_prob     = sentence_prob,
+                                                                                        doc_result  = doc_result,
+                                                                                        sentence_length = sentence_length,
+                                                                                       )
+                    breakdown[name]               = sentence_prob
+                except Exception as e:
+                    logger.warning(f"Metric {name} failed for sentence: {e}")
+                    # Use document probability as fallback
+                    breakdown[name] = doc_result.ai_probability
         # Use ensemble to combine sentence-level metrics
         if sentence_metric_results:
                                                                  domain         = self.domain,
                                                                 )
+                return (ensemble_sentence_result.ai_probability,
+                        ensemble_sentence_result.human_probability,
+                        ensemble_sentence_result.mixed_probability,
+                        ensemble_sentence_result.overall_confidence,
+                        breakdown)
             except Exception as e:
                 logger.warning(f"Sentence ensemble failed: {e}")
         return adjusted_prob
+    def _create_sentence_metric_result(self, metric_name: str, ai_prob: float, doc_result: MetricResult, sentence_length: int) -> MetricResult:
         """
         Create sentence-level MetricResult from document-level result
         """
+        # IMPROVED: Calculate confidence based on sentence characteristics
+        sentence_confidence = self._calculate_sentence_confidence(doc_result.confidence, sentence_length)
         return MetricResult(metric_name       = metric_name,
                             ai_probability    = ai_prob,
                            )
+    def _calculate_sentence_confidence(self, doc_confidence: float, sentence_length: int) -> float:
         """
+        IMPROVED: Calculate confidence for sentence-level analysis with length consideration
         """
+        base_reduction = 0.8
+        # Scale confidence penalty with sentence length
+        length_penalty = max(0.3, min(1.0, sentence_length / 12.0))  # Normalize around 12 words
+        return max(0.1, doc_confidence * base_reduction * length_penalty)
     def _calculate_weighted_probability(self, metric_results: Dict[str, MetricResult], weights: Dict[str, float], breakdown: Dict[str, float]) -> Tuple[float, float, float, float, Dict[str, float]]:
                     confidences.append(result.confidence)
                     total_weight += weight
+        if ((not weighted_ai_probs) or (total_weight == 0)):
+            return 0.5, 0.5, 0.0, 0.5, breakdown or {}
         ai_prob        = sum(weighted_ai_probs) / total_weight
         human_prob     = sum(weighted_human_probs) / total_weight
         else:
             # Calculate from metrics
             return self._calculate_weighted_probability(metric_results, weights, {})
     def _apply_domain_specific_adjustments(self, sentence: str, ai_prob: float, sentence_length: int) -> float:
         """
+        Apply domain-specific adjustments to AI probability with limits
         """
+        original_prob  = ai_prob
+        adjustments    = list()
         sentence_lower = sentence.lower()
         # Technical & AI/ML domains
+        if (self.domain in [Domain.AI_ML, Domain.SOFTWARE_DEV, Domain.TECHNICAL_DOC, Domain.ENGINEERING, Domain.SCIENCE]):
             if self._has_technical_terms(sentence_lower):
+                adjustments.append(1.1)
             elif self._has_code_like_patterns(sentence):
+                adjustments.append(1.15)
+            elif (sentence_length > 35):
+                adjustments.append(1.05)
         # Creative & informal domains
+        elif (self.domain in [Domain.CREATIVE, Domain.SOCIAL_MEDIA, Domain.BLOG_PERSONAL]):
             if self._has_informal_language(sentence_lower):
+                adjustments.append(0.7)
             elif self._has_emotional_language(sentence):
+                adjustments.append(0.8)
             elif (sentence_length < 10):
+                adjustments.append(0.8)
         # Academic & formal domains
+        elif (self.domain in [Domain.ACADEMIC, Domain.LEGAL, Domain.MEDICAL]):
             if self._has_citation_patterns(sentence):
+                adjustments.append(0.8)
             elif self._has_technical_terms(sentence_lower):
+                adjustments.append(1.1)
             elif (sentence_length > 40):
+                adjustments.append(1.1)
         # Business & professional domains
+        elif (self.domain in [Domain.BUSINESS, Domain.MARKETING, Domain.JOURNALISM]):
             if self._has_business_jargon(sentence_lower):
+                adjustments.append(1.05)
             elif self._has_ambiguous_phrasing(sentence_lower):
+                adjustments.append(0.9)
             elif (15 <= sentence_length <= 25):
+                adjustments.append(0.9)
         # Tutorial & educational domains
         elif (self.domain == Domain.TUTORIAL):
             if self._has_instructional_language(sentence_lower):
+                adjustments.append(0.85)
             elif self._has_step_by_step_pattern(sentence):
+                adjustments.append(0.8)
             elif self._has_examples(sentence):
+                adjustments.append(0.9)
         # General domain - minimal adjustments
+        elif (self.domain == Domain.GENERAL):
             if self._has_complex_structure(sentence):
+                adjustments.append(0.9)
             elif self._has_repetition(sentence):
+                adjustments.append(1.1)
+        # Apply adjustments with limits - take strongest 2 adjustments maximum
+        if adjustments:
+            # Sort by impact (farthest from 1.0)
+            adjustments.sort(key = lambda x: abs(x - 1.0), reverse = True)
+            # Limit to 2 strongest
+            strongest_adjustments = adjustments[:2]
+            for adjustment in strongest_adjustments:
+                ai_prob *= adjustment
+        # Ensure probability stays within bounds and doesn't change too drastically : Maximum 30% change from original
+        max_change   = 0.3
+        bounded_prob = max(original_prob - max_change, min(original_prob + max_change, ai_prob))
+        return max(0.0, min(1.0, bounded_prob))
     def _apply_metric_specific_adjustments(self, metric_name: str, sentence: str, base_prob: float, sentence_length: int, thresholds: MetricThresholds) -> float:
     def _get_color_for_probability(self, probability: float, is_mixed_content: bool = False, mixed_prob: float = 0.0) -> Tuple[str, str, str]:
         """
+        Get color class with mixed content support and no threshold gaps
         """
+        # Handle probability = 1.0 explicitly
+        if (probability >= 1.0):
+            return "very-high-ai", "#fecaca", "Very likely AI-generated (100%)"
         # Check mixed content first
         if (is_mixed_content and (mixed_prob > self.MIXED_THRESHOLD)):
             return "mixed-content", "#e9d5ff", f"Mixed AI/Human content ({mixed_prob:.1%} mixed)"
             if (min_thresh <= probability < max_thresh):
                 return color_class, color_hex, tooltip
+        # Fallback for probability = 1.0 (should be caught above, but just in case)
+        return "very-high-ai", "#fecaca", "Very likely AI-generated"
     def _generate_ensemble_tooltip(self, sentence: str, ai_prob: float, human_prob: float, mixed_prob: float, confidence: float, confidence_level: ConfidenceLevel,
+                                   tooltip_base: str, breakdown: Optional[Dict[str, float]] = None, is_mixed_content: bool = False) -> str:
         """
         Generate enhanced tooltip with ENSEMBLE information
         """
             for metric, prob in list(breakdown.items())[:4]:
                 tooltip += f"\n• {metric}: {prob:.1%}"
+        tooltip += f"\n\nEnsemble Method: {getattr(self.ensemble, 'primary_method', 'fallback')}"
         return tooltip
         Analyze sentence complexity (0 = simple, 1 = complex)
         """
         words = sentence.split()
+        if (len(words) < 5):
             return 0.2
         complexity_indicators = ['although', 'because', 'while', 'when', 'if', 'since', 'unless', 'until', 'which', 'that', 'who', 'whom', 'whose', 'and', 'but', 'or', 'yet', 'so', 'however', 'therefore', 'moreover', 'furthermore', 'nevertheless', ',', ';', ':', '—']
         clause_indicators = [',', ';', 'and', 'but', 'or', 'because', 'although']
         clause_count      = sum(1 for indicator in clause_indicators if indicator in sentence.lower())
+        score            += min(0.2, clause_count * 0.05)
         return min(1.0, score)
         for sentence in sentences:
             clean_sentence = sentence.strip()
+            if (len(clean_sentence) >= 3):
                 filtered_sentences.append(clean_sentence)
         return filtered_sentences
         total_sentences = len(highlighted_sentences)
         # Calculate weighted risk score
+        weighted_risk   = 0.0
         for sent in highlighted_sentences:
             weight         = self.RISK_WEIGHTS.get(sent.color_class, 0.4)

logs/application/app_2025-11-07.log ADDED Viewed

The diff for this file is too large to render. See raw diff

metrics/multi_perturbation_stability.py CHANGED Viewed

@@ -59,6 +59,7 @@ class MultiPerturbationStabilityMetric(BaseMetric):
                 self.gpt_model, self.gpt_tokenizer = gpt_result
                 # Move model to appropriate device
                 self.gpt_model.to(self.device)
             else:
                 logger.error("Failed to load GPT-2 model for MultiPerturbationStability")
@@ -76,9 +77,20 @@ class MultiPerturbationStabilityMetric(BaseMetric):
                 if (self.mask_tokenizer.pad_token is None):
                     self.mask_tokenizer.pad_token = self.mask_tokenizer.eos_token or '[PAD]'
             else:
                 logger.warning("Failed to load mask model, using GPT-2 only")
             self.is_initialized = True
             logger.success("MultiPerturbationStability metric initialized successfully")
@@ -89,12 +101,51 @@ class MultiPerturbationStabilityMetric(BaseMetric):
             return False
     def compute(self, text: str, **kwargs) -> MetricResult:
         """
         Compute MultiPerturbationStability analysis with FULL DOMAIN THRESHOLD INTEGRATION
         """
         try:
-            if ((not text) or (len(text.strip()) < 100)):
                 return MetricResult(metric_name       = self.name,
                                     ai_probability    = 0.5,
                                     human_probability = 0.5,
@@ -121,13 +172,16 @@ class MultiPerturbationStabilityMetric(BaseMetric):
                                    )
             # Calculate MultiPerturbationStability features
-            features                        = self._calculate_stability_features(text)
             # Calculate raw MultiPerturbationStability score (0-1 scale)
-            raw_stability_score, confidence = self._analyze_stability_patterns(features)
             # Apply domain-specific thresholds to convert raw score to probabilities
-            ai_prob, human_prob, mixed_prob = self._apply_domain_thresholds(raw_stability_score, multi_perturbation_stability_thresholds, features)
             # Apply confidence multiplier from domain thresholds
             confidence                     *= multi_perturbation_stability_thresholds.confidence_multiplier
@@ -211,54 +265,75 @@ class MultiPerturbationStabilityMetric(BaseMetric):
     def _calculate_stability_features(self, text: str) -> Dict[str, Any]:
         """
-        Calculate comprehensive MultiPerturbationStability features
         """
         if not self.gpt_model or not self.gpt_tokenizer:
             return self._get_default_features()
         try:
             # Preprocess text for better analysis
-            processed_text        = self._preprocess_text_for_analysis(text)
             # Calculate original text likelihood
-            original_likelihood   = self._calculate_likelihood(processed_text)
             # Generate perturbations and calculate perturbed likelihoods
-            perturbations         = self._generate_perturbations(processed_text, num_perturbations = 5)
             perturbed_likelihoods = list()
-            for perturbed_text in perturbations:
                 if (perturbed_text and (perturbed_text != processed_text)):
-                    likelihood = self._calculate_likelihood(perturbed_text)
                     if (likelihood > 0):
                         perturbed_likelihoods.append(likelihood)
             # Calculate stability metrics
             if perturbed_likelihoods:
-                stability_score          = self._calculate_stability_score(original_likelihood, perturbed_likelihoods)
-                curvature_score          = self._calculate_curvature_score(original_likelihood, perturbed_likelihoods)
-                variance_score           = np.var(perturbed_likelihoods) if len(perturbed_likelihoods) > 1 else 0.0
                 avg_perturbed_likelihood = np.mean(perturbed_likelihoods)
             else:
-                stability_score          = 0.5
-                curvature_score          = 0.5
-                variance_score           = 0.1
-                avg_perturbed_likelihood = original_likelihood
             # Calculate likelihood ratio
-            likelihood_ratio            = original_likelihood / avg_perturbed_likelihood if avg_perturbed_likelihood > 0 else 1.0
             # Chunk-based analysis for whole-text understanding
-            chunk_stabilities           = self._calculate_chunk_stability(processed_text, chunk_size=150)
-            stability_variance          = np.var(chunk_stabilities) if chunk_stabilities else 0.0
-            avg_chunk_stability         = np.mean(chunk_stabilities) if chunk_stabilities else stability_score
-            # Normalize scores to 0-1 range
-            normalized_stability        = min(1.0, max(0.0, stability_score))
-            normalized_curvature        = min(1.0, max(0.0, curvature_score))
-            normalized_likelihood_ratio = min(2.0, likelihood_ratio) / 2.0    # Normalize to 0-1
             return {"original_likelihood"         : round(original_likelihood, 4),
                     "avg_perturbed_likelihood"    : round(avg_perturbed_likelihood, 4),
@@ -281,59 +356,87 @@ class MultiPerturbationStabilityMetric(BaseMetric):
     def _calculate_likelihood(self, text: str) -> float:
         """
-        Calculate log-likelihood of text using GPT-2 with robust error handling
         """
         try:
             # Check text length before tokenization
             if (len(text.strip()) < 10):
-                return 0.0
-            # Configure tokenizer for proper padding
-            tokenizer      = self._configure_tokenizer_padding(self.gpt_tokenizer)
             # Tokenize text with proper settings
-            encodings      = tokenizer(text,
-                                       return_tensors = 'pt',
-                                       truncation     = True,
-                                       max_length     = 512,
-                                       padding        = True,
-                                       return_attention_mask = True,
-                                      )
             input_ids      = encodings.input_ids.to(self.device)
             attention_mask = encodings.attention_mask.to(self.device)
             # Minimum tokens for meaningful analysis
-            if ((input_ids.numel() == 0) or (input_ids.size(1) < 5)):
-                return 0.0
-            # Calculate negative log likelihood
             with torch.no_grad():
-                outputs = self.gpt_model(input_ids,
-                                         attention_mask = attention_mask,
-                                         labels         = input_ids,
-                                        )
-                loss    = outputs.loss
-            # Convert to positive log likelihood (higher = more likely)
-            log_likelihood = -loss.item()
-            # Reasonable range check (typical values are between -10 and 10)
-            if (abs(log_likelihood) > 100):
-                logger.warning(f"Extreme likelihood value detected: {log_likelihood}")
-                return 0.0
-            return log_likelihood
         except Exception as e:
             logger.warning(f"Likelihood calculation failed: {repr(e)}")
-            return 0.0
     def _generate_perturbations(self, text: str, num_perturbations: int = 5) -> List[str]:
         """
-        Generate perturbed versions of the text with robust error handling
         """
         perturbations = list()
@@ -383,33 +486,37 @@ class MultiPerturbationStabilityMetric(BaseMetric):
                         logger.debug(f"Word swapping perturbation failed: {e}")
                         continue
-            # Method 3: RoBERTa-specific masked word replacement
             if (self.mask_model and self.mask_tokenizer and (len(words) > 4) and len(perturbations) < num_perturbations):
                 try:
-                    roberta_perturbations = self._generate_roberta_masked_perturbations(processed_text,
-                                                                                        words,
-                                                                                        num_perturbations - len(perturbations))
                     perturbations.extend(roberta_perturbations)
                 except Exception as e:
-                    logger.warning(f"RoBERTa masked perturbation failed: {repr(e)}")
             # Method 4: Synonym replacement as fallback
             if (len(perturbations) < num_perturbations):
                 try:
-                    synonym_perturbations = self._generate_synonym_perturbations(processed_text,
-                                                                                 words,
-                                                                                 num_perturbations - len(perturbations))
                     perturbations.extend(synonym_perturbations)
                 except Exception as e:
-                    logger.debug(f"Synonym replacement failed: {e}")
             # Ensure we have at least some perturbations
             if not perturbations:
                 # Fallback: create simple variations
-                fallback_perturbations = self._generate_fallback_perturbations(processed_text, words)
                 perturbations.extend(fallback_perturbations)
             # Remove duplicates and ensure we don't exceed requested number
@@ -423,19 +530,23 @@ class MultiPerturbationStabilityMetric(BaseMetric):
         except Exception as e:
             logger.warning(f"Perturbation generation failed: {repr(e)}")
-            # Return at least the original text as fallback
-            return [text]
     def _generate_roberta_masked_perturbations(self, text: str, words: List[str], max_perturbations: int) -> List[str]:
         """
-        Generate perturbations using RoBERTa mask filling
         """
         perturbations = list()
         try:
-            # RoBERTa uses <mask> token
-            roberta_mask_token  = "<mask>"
             # Select words to mask (avoid very short words and punctuation)
             candidate_positions = [i for i, word in enumerate(words) if (len(word) > 3) and word.isalpha() and word.lower() not in ['the', 'and', 'but', 'for', 'with']]
@@ -448,7 +559,7 @@ class MultiPerturbationStabilityMetric(BaseMetric):
             # Try multiple mask positions
             attempts          = min(max_perturbations * 2, len(candidate_positions))
-            positions_to_try  = np.random.choice(candidate_positions, min(attempts, len(candidate_positions)), replace=False)
             for pos in positions_to_try:
                 if (len(perturbations) >= max_perturbations):
@@ -461,15 +572,15 @@ class MultiPerturbationStabilityMetric(BaseMetric):
                     masked_words[pos] = roberta_mask_token
                     masked_text       = ' '.join(masked_words)
-                    # RoBERTa works better with proper sentence structure
                     if not masked_text.endswith(('.', '!', '?')):
                         masked_text += '.'
-                    # Tokenize with RoBERTa-specific settings
                     inputs = self.mask_tokenizer(masked_text,
                                                  return_tensors = "pt",
                                                  truncation     = True,
-                                                 max_length     = min(128, self.mask_tokenizer.model_max_length),  # Conservative length
                                                  padding        = True,
                                                 )
@@ -508,15 +619,14 @@ class MultiPerturbationStabilityMetric(BaseMetric):
                             if (self._is_valid_perturbation(new_text, text)):
                                 perturbations.append(new_text)
-                                # Use first valid prediction
-                                break
                 except Exception as e:
-                    logger.debug(f"RoBERTa mask filling failed for position {pos}: {e}")
                     continue
         except Exception as e:
-            logger.warning(f"RoBERTa masked perturbations failed: {e}")
         return perturbations
@@ -559,7 +669,7 @@ class MultiPerturbationStabilityMetric(BaseMetric):
                         perturbations.append(new_text)
         except Exception as e:
-            logger.debug(f"Synonym replacement failed: {e}")
         return perturbations
@@ -585,41 +695,72 @@ class MultiPerturbationStabilityMetric(BaseMetric):
                 perturbations.append(text.capitalize())
         except Exception as e:
-            logger.debug(f"Fallback perturbation failed: {e}")
         return [p for p in perturbations if p and p != text][:3]
     def _calculate_stability_score(self, original_likelihood: float, perturbed_likelihoods: List[float]) -> float:
         """
-        Calculate text stability score under perturbations : AI text tends to be less stable (larger likelihood drops)
         """
         if ((not perturbed_likelihoods) or (original_likelihood <= 0)):
-            return 0.5
-        # Calculate average likelihood drop
-        likelihood_drops = [(original_likelihood - pl) / original_likelihood for pl in perturbed_likelihoods]
-        avg_drop         = np.mean(likelihood_drops) if likelihood_drops else 0.0
-        # Higher drop = less stable = more AI-like : Normalize to 0-1 scale (assume max drop of 50%)
-        stability_score  = min(1.0, avg_drop / 0.5)
-        return stability_score
     def _calculate_curvature_score(self, original_likelihood: float, perturbed_likelihoods: List[float]) -> float:
         """
-        Calculate likelihood curvature score : AI text often has different curvature properties
         """
         if ((not perturbed_likelihoods) or (original_likelihood <= 0)):
-            return 0.5
         # Calculate variance of likelihood changes
         likelihood_changes = [abs(original_likelihood - pl) for pl in perturbed_likelihoods]
-        change_variance    = np.var(likelihood_changes) if len(likelihood_changes) > 1 else 0.0
-        # Higher variance = more curvature = potentially more AI-like : Normalize based on typical variance ranges
-        curvature_score    = min(1.0, change_variance * 10.0)  # Adjust scaling factor as needed
         return curvature_score
@@ -637,7 +778,7 @@ class MultiPerturbationStabilityMetric(BaseMetric):
             if (len(chunk) > 50):
                 try:
-                    chunk_likelihood = self._calculate_likelihood(chunk)
                     if (chunk_likelihood > 0):
                         # Generate a simple perturbation for this chunk
@@ -649,11 +790,12 @@ class MultiPerturbationStabilityMetric(BaseMetric):
                             indices_to_keep      = np.random.choice(len(chunk_words), len(chunk_words) - delete_count, replace=False)
                             perturbed_chunk      = ' '.join([chunk_words[i] for i in sorted(indices_to_keep)])
-                            perturbed_likelihood = self._calculate_likelihood(perturbed_chunk)
                             if (perturbed_likelihood > 0):
                                 stability = (chunk_likelihood - perturbed_likelihood) / chunk_likelihood
                                 stabilities.append(min(1.0, max(0.0, stability)))
                 except Exception:
                     continue
@@ -662,7 +804,7 @@ class MultiPerturbationStabilityMetric(BaseMetric):
     def _analyze_stability_patterns(self, features: Dict[str, Any]) -> tuple:
         """
-        Analyze MultiPerturbationStability patterns to determine RAW MultiPerturbationStability score (0-1 scale) : Higher score = more AI-like
         """
         # Check feature validity first
         required_features = ['stability_score', 'curvature_score', 'normalized_likelihood_ratio', 'stability_variance', 'perturbation_variance']
@@ -675,61 +817,76 @@ class MultiPerturbationStabilityMetric(BaseMetric):
         # Initialize ai_indicator list
-        ai_indicators = list()
         # High stability score suggests AI (larger likelihood drops)
-        if (features['stability_score'] > 0.6):
-            ai_indicators.append(0.8)
-        elif (features['stability_score'] > 0.3):
-            ai_indicators.append(0.5)
         else:
-            ai_indicators.append(0.2)
         # High curvature score suggests AI
-        if (features['curvature_score'] > 0.7):
-            ai_indicators.append(0.7)
-        elif (features['curvature_score'] > 0.4):
-            ai_indicators.append(0.4)
         else:
-            ai_indicators.append(0.2)
         # High likelihood ratio suggests AI (original much more likely than perturbations)
-        if (features['normalized_likelihood_ratio'] > 0.8):
-            ai_indicators.append(0.9)
-        elif (features['normalized_likelihood_ratio'] > 0.6):
-            ai_indicators.append(0.6)
         else:
-            ai_indicators.append(0.3)
         # Low stability variance suggests AI (consistent across chunks)
-        if (features['stability_variance'] < 0.05):
-            ai_indicators.append(0.7)
-        elif (features['stability_variance'] < 0.1):
-            ai_indicators.append(0.4)
-        else:
-            ai_indicators.append(0.2)
-        # High perturbation variance suggests AI
-        if (features['perturbation_variance'] > 0.1):
-            ai_indicators.append(0.6)
-        elif (features['perturbation_variance'] > 0.05):
-            ai_indicators.append(0.4)
         else:
-            ai_indicators.append(0.2)
         # Calculate raw score and confidence
-        raw_score  = np.mean(ai_indicators) if ai_indicators else 0.5
-        confidence = 1.0 - (np.std(ai_indicators) / 0.5) if ai_indicators else 0.5
         confidence = max(0.1, min(0.9, confidence))
         return raw_score, confidence
@@ -770,16 +927,16 @@ class MultiPerturbationStabilityMetric(BaseMetric):
     def _get_default_features(self) -> Dict[str, Any]:
         """
-        Return default features when analysis is not possible
         """
         return {"original_likelihood"         : 2.0,
                 "avg_perturbed_likelihood"    : 1.8,
                 "likelihood_ratio"            : 1.1,
                 "normalized_likelihood_ratio" : 0.55,
-                "stability_score"             : 0.5,
-                "curvature_score"             : 0.5,
                 "perturbation_variance"       : 0.05,
-                "avg_chunk_stability"         : 0.5,
                 "stability_variance"          : 0.1,
                 "num_perturbations"           : 0,
                 "num_valid_perturbations"     : 0,
@@ -814,14 +971,14 @@ class MultiPerturbationStabilityMetric(BaseMetric):
         # Normalize whitespace
         text = ' '.join(text.split())
-        # RoBERTa works better with proper punctuation
         if not text.endswith(('.', '!', '?')):
             text += '.'
         # Truncate to safe length
         if (len(text) > 1000):
             sentences = text.split('. ')
-            if len(sentences) > 1:
                 # Keep first few sentences
                 text = '. '.join(sentences[:3]) + '.'
@@ -831,50 +988,54 @@ class MultiPerturbationStabilityMetric(BaseMetric):
         return text
-    def _configure_tokenizer_padding(self, tokenizer) -> Any:
-        """
-        Configure tokenizer for proper padding
-        """
-        if tokenizer.pad_token is None:
-            if tokenizer.eos_token is not None:
-                tokenizer.pad_token = tokenizer.eos_token
-            else:
-                tokenizer.add_special_tokens({'pad_token': '[PAD]'})
-        tokenizer.padding_side = "left"
-        return tokenizer
     def _clean_roberta_token(self, token: str) -> str:
         """
-        Clean tokens from RoBERTa tokenizer
         """
         if not token:
             return ""
-        # Remove RoBERTa-specific artifacts
         token = token.replace('Ġ', ' ')  # RoBERTa space marker
         token = token.replace('</s>', '')
         token = token.replace('<s>', '')
         token = token.replace('<pad>', '')
-        # Remove leading/trailing whitespace and punctuation
-        token = token.strip(' .,!?;:"\'')
-        return token
     def _is_valid_perturbation(self, perturbed_text: str, original_text: str) -> bool:
         """
-        Check if a perturbation is valid
         """
-        # Not too short
-        return (perturbed_text and
-                len(perturbed_text.strip()) > 10 and
-                perturbed_text != original_text and
-                len(perturbed_text) > len(original_text) * 0.5)
     def cleanup(self):

                 self.gpt_model, self.gpt_tokenizer = gpt_result
                 # Move model to appropriate device
                 self.gpt_model.to(self.device)
+                logger.success("✓ GPT-2 model loaded for MultiPerturbationStability")
             else:
                 logger.error("Failed to load GPT-2 model for MultiPerturbationStability")
                 if (self.mask_tokenizer.pad_token is None):
                     self.mask_tokenizer.pad_token = self.mask_tokenizer.eos_token or '[PAD]'
+                # Ensure tokenizer has mask token
+                if not hasattr(self.mask_tokenizer, 'mask_token') or self.mask_tokenizer.mask_token is None:
+                    self.mask_tokenizer.mask_token = "<mask>"
+                logger.success("✓ DistilRoBERTa model loaded for MultiPerturbationStability")
             else:
                 logger.warning("Failed to load mask model, using GPT-2 only")
+            # Verify model loading
+            if not self._verify_model_loading():
+                logger.error("Model verification failed")
+                return False
             self.is_initialized = True
             logger.success("MultiPerturbationStability metric initialized successfully")
             return False
+    def _verify_model_loading(self) -> bool:
+        """
+        Verify that models are properly loaded and working
+        """
+        try:
+            test_text = "This is a test sentence for model verification."
+            # Test GPT-2 model
+            if self.gpt_model and self.gpt_tokenizer:
+                gpt_likelihood = self._calculate_likelihood(text = test_text)
+                logger.info(f"GPT-2 test - Likelihood: {gpt_likelihood:.4f}")
+            else:
+                logger.error("GPT-2 model not loaded")
+                return False
+            # Test DistilRoBERTa model if available
+            if self.mask_model and self.mask_tokenizer:
+                # Test mask token
+                if hasattr(self.mask_tokenizer, 'mask_token') and self.mask_tokenizer.mask_token:
+                    logger.info(f"DistilRoBERTa mask token: '{self.mask_tokenizer.mask_token}'")
+                    # Test basic tokenization
+                    inputs = self.mask_tokenizer(test_text, return_tensors = "pt")
+                    logger.info(f"DistilRoBERTa tokenization test - Input shape: {inputs['input_ids'].shape}")
+                else:
+                    logger.warning("DistilRoBERTa mask token not available")
+            else:
+                logger.warning("DistilRoBERTa model not loaded")
+            return True
+        except Exception as e:
+            logger.error(f"Model verification failed: {e}")
+            return False
     def compute(self, text: str, **kwargs) -> MetricResult:
         """
         Compute MultiPerturbationStability analysis with FULL DOMAIN THRESHOLD INTEGRATION
         """
         try:
+            if ((not text) or (len(text.strip()) < 50)):
                 return MetricResult(metric_name       = self.name,
                                     ai_probability    = 0.5,
                                     human_probability = 0.5,
                                    )
             # Calculate MultiPerturbationStability features
+            features                        = self._calculate_stability_features(text = text)
             # Calculate raw MultiPerturbationStability score (0-1 scale)
+            raw_stability_score, confidence = self._analyze_stability_patterns(features = features)
             # Apply domain-specific thresholds to convert raw score to probabilities
+            ai_prob, human_prob, mixed_prob = self._apply_domain_thresholds(raw_score  = raw_stability_score,
+                                                                            thresholds = multi_perturbation_stability_thresholds,
+                                                                            features   = features,
+                                                                           )
             # Apply confidence multiplier from domain thresholds
             confidence                     *= multi_perturbation_stability_thresholds.confidence_multiplier
     def _calculate_stability_features(self, text: str) -> Dict[str, Any]:
         """
+        Calculate comprehensive MultiPerturbationStability features with diagnostic logging
         """
         if not self.gpt_model or not self.gpt_tokenizer:
             return self._get_default_features()
         try:
             # Preprocess text for better analysis
+            processed_text        = self._preprocess_text_for_analysis(text = text)
             # Calculate original text likelihood
+            original_likelihood   = self._calculate_likelihood(text = processed_text)
+            logger.debug(f"Original likelihood: {original_likelihood:.4f}")
             # Generate perturbations and calculate perturbed likelihoods
+            perturbations         = self._generate_perturbations(text              = processed_text,
+                                                                 num_perturbations = 10,
+                                                                )
+            logger.debug(f"Generated {len(perturbations)} perturbations")
             perturbed_likelihoods = list()
+            for idx, perturbed_text in enumerate(perturbations):
                 if (perturbed_text and (perturbed_text != processed_text)):
+                    likelihood = self._calculate_likelihood(text = perturbed_text)
                     if (likelihood > 0):
                         perturbed_likelihoods.append(likelihood)
+                        logger.debug(f"Perturbation {idx}: likelihood={likelihood:.4f}")
+            logger.info(f"Valid perturbations: {len(perturbed_likelihoods)}/{len(perturbations)}")
             # Calculate stability metrics
             if perturbed_likelihoods:
+                stability_score          = self._calculate_stability_score(original_likelihood   = original_likelihood,
+                                                                           perturbed_likelihoods = perturbed_likelihoods,
+                                                                          )
+                curvature_score          = self._calculate_curvature_score(original_likelihood   = original_likelihood,
+                                                                           perturbed_likelihoods = perturbed_likelihoods,
+                                                                          )
+                variance_score           = np.var(perturbed_likelihoods) if (len(perturbed_likelihoods) > 1) else 0.0
                 avg_perturbed_likelihood = np.mean(perturbed_likelihoods)
+                logger.info(f"Stability: {stability_score:.3f}, Curvature: {curvature_score:.3f}")
             else:
+                # Use meaningful defaults when perturbations fail
+                stability_score          = 0.3  # Assume more human-like when no perturbations work
+                curvature_score          = 0.3
+                variance_score           = 0.05
+                avg_perturbed_likelihood = original_likelihood * 0.9  # Assume some drop
+                logger.warning("No valid perturbations, using fallback values")
             # Calculate likelihood ratio
+            likelihood_ratio             = original_likelihood / avg_perturbed_likelihood if avg_perturbed_likelihood > 0 else 1.0
             # Chunk-based analysis for whole-text understanding
+            chunk_stabilities            = self._calculate_chunk_stability(text       = processed_text,
+                                                                           chunk_size = 150,
+                                                                          )
+            stability_variance           = np.var(chunk_stabilities) if chunk_stabilities else 0.1
+            avg_chunk_stability          = np.mean(chunk_stabilities) if chunk_stabilities else stability_score
+            # Better normalization to prevent extreme values
+            normalized_stability         = min(1.0, max(0.0, stability_score))
+            normalized_curvature         = min(1.0, max(0.0, curvature_score))
+            normalized_likelihood_ratio  = min(3.0, max(0.33, likelihood_ratio)) / 3.0
             return {"original_likelihood"         : round(original_likelihood, 4),
                     "avg_perturbed_likelihood"    : round(avg_perturbed_likelihood, 4),
     def _calculate_likelihood(self, text: str) -> float:
         """
+        Calculate proper log-likelihood using token probabilities
+        Inspired by DetectGPT's likelihood calculation approach
         """
         try:
             # Check text length before tokenization
             if (len(text.strip()) < 10):
+                return 2.0  # Return reasonable baseline
+            if not self.gpt_model or not self.gpt_tokenizer:
+                logger.warning("GPT model not available for likelihood calculation")
+                return 2.0
+            # Ensure tokenizer has pad token
+            if self.gpt_tokenizer.pad_token is None:
+                self.gpt_tokenizer.pad_token = self.gpt_tokenizer.eos_token
             # Tokenize text with proper settings
+            encodings      = self.gpt_tokenizer(text,
+                                                return_tensors        = 'pt',
+                                                truncation            = True,
+                                                max_length            = 256,
+                                                padding               = True,
+                                                return_attention_mask = True,
+                                               )
             input_ids      = encodings.input_ids.to(self.device)
             attention_mask = encodings.attention_mask.to(self.device)
             # Minimum tokens for meaningful analysis
+            if ((input_ids.numel() == 0) or (input_ids.size(1) < 3)):
+                return 2.0
+            # Calculate proper log-likelihood using token probabilities
             with torch.no_grad():
+                outputs        = self.gpt_model(input_ids,
+                                                attention_mask = attention_mask,
+                                               )
+                logits         = outputs.logits
+                # Calculate log probabilities for each token
+                log_probs      = torch.nn.functional.log_softmax(logits, dim = -1)
+                # Get the log probability of each actual token
+                log_likelihood = 0.0
+                token_count    = 0
+                for i in range(input_ids.size(1) - 1):
+                    # Only consider non-padding tokens
+                    if (attention_mask[0, i] == 1):
+                        token_id        = input_ids[0, i + 1]  # Next token prediction
+                        log_prob        = log_probs[0, i, token_id]
+                        log_likelihood += log_prob.item()
+                        token_count    += 1
+                # Normalize by token count to get average log likelihood per token
+                if (token_count > 0):
+                    avg_log_likelihood = log_likelihood / token_count
+                else:
+                    avg_log_likelihood = 0.0
+            # Convert to positive scale and normalize
+            # Typical GPT-2 log probabilities range from ~-10 to ~-2
+            # Higher normalized value = more likely text
+            normalized_likelihood = max(0.5, min(10.0, -avg_log_likelihood))
+            return normalized_likelihood
         except Exception as e:
             logger.warning(f"Likelihood calculation failed: {repr(e)}")
+            return 2.0  # Return reasonable baseline on error
     def _generate_perturbations(self, text: str, num_perturbations: int = 5) -> List[str]:
         """
+        Generate perturbed versions of the text using multiple techniques:
+        1. Word deletion (simple but effective)
+        2. Word swapping (preserve meaning)
+        3. DistilRoBERTa masked prediction (DetectGPT-inspired, using lighter model than T5)
+        4. Synonym replacement (fallback)
         """
         perturbations = list()
                         logger.debug(f"Word swapping perturbation failed: {e}")
                         continue
+            # Method 3: DistilRoBERTa-based masked word replacement (DetectGPT-inspired)
             if (self.mask_model and self.mask_tokenizer and (len(words) > 4) and len(perturbations) < num_perturbations):
                 try:
+                    roberta_perturbations = self._generate_roberta_masked_perturbations(text              = processed_text,
+                                                                                        words             = words,
+                                                                                        max_perturbations = num_perturbations - len(perturbations),
+                                                                                       )
                     perturbations.extend(roberta_perturbations)
                 except Exception as e:
+                    logger.warning(f"DistilRoBERTa masked perturbation failed: {repr(e)}")
             # Method 4: Synonym replacement as fallback
             if (len(perturbations) < num_perturbations):
                 try:
+                    synonym_perturbations = self._generate_synonym_perturbations(text              = processed_text,
+                                                                                 words             = words,
+                                                                                 max_perturbations = num_perturbations - len(perturbations),
+                                                                                )
                     perturbations.extend(synonym_perturbations)
                 except Exception as e:
+                    logger.debug(f"Synonym replacement failed: {repr(e)}")
             # Ensure we have at least some perturbations
             if not perturbations:
                 # Fallback: create simple variations
+                fallback_perturbations = self._generate_fallback_perturbations(text  = processed_text,
+                                                                               words = words,
+                                                                              )
                 perturbations.extend(fallback_perturbations)
             # Remove duplicates and ensure we don't exceed requested number
         except Exception as e:
             logger.warning(f"Perturbation generation failed: {repr(e)}")
+            return [text]  # Return at least the original text as fallback
     def _generate_roberta_masked_perturbations(self, text: str, words: List[str], max_perturbations: int) -> List[str]:
         """
+        Generate perturbations using DistilRoBERTa mask filling
+        This is inspired by DetectGPT but uses a lighter model (DistilRoBERTa instead of T5)
         """
         perturbations = list()
         try:
+            # Use the proper DistilRoBERTa mask token from tokenizer
+            if hasattr(self.mask_tokenizer, 'mask_token') and self.mask_tokenizer.mask_token:
+                roberta_mask_token = self.mask_tokenizer.mask_token
+            else:
+                roberta_mask_token = "<mask>"  # Fallback
             # Select words to mask (avoid very short words and punctuation)
             candidate_positions = [i for i, word in enumerate(words) if (len(word) > 3) and word.isalpha() and word.lower() not in ['the', 'and', 'but', 'for', 'with']]
             # Try multiple mask positions
             attempts          = min(max_perturbations * 2, len(candidate_positions))
+            positions_to_try  = np.random.choice(candidate_positions, min(attempts, len(candidate_positions)), replace = False)
             for pos in positions_to_try:
                 if (len(perturbations) >= max_perturbations):
                     masked_words[pos] = roberta_mask_token
                     masked_text       = ' '.join(masked_words)
+                    # DistilRoBERTa works better with proper sentence structure
                     if not masked_text.endswith(('.', '!', '?')):
                         masked_text += '.'
+                    # Tokenize with DistilRoBERTa-specific settings
                     inputs = self.mask_tokenizer(masked_text,
                                                  return_tensors = "pt",
                                                  truncation     = True,
+                                                 max_length     = min(128, self.mask_tokenizer.model_max_length),
                                                  padding        = True,
                                                 )
                             if (self._is_valid_perturbation(new_text, text)):
                                 perturbations.append(new_text)
+                                break  # Use first valid prediction
                 except Exception as e:
+                    logger.debug(f"DistilRoBERTa mask filling failed for position {pos}: {e}")
                     continue
         except Exception as e:
+            logger.warning(f"DistilRoBERTa masked perturbations failed: {e}")
         return perturbations
                         perturbations.append(new_text)
         except Exception as e:
+            logger.debug(f"Synonym replacement failed: {repr(e)}")
         return perturbations
                 perturbations.append(text.capitalize())
         except Exception as e:
+            logger.debug(f"Fallback perturbation failed: {repr(e)}")
         return [p for p in perturbations if p and p != text][:3]
     def _calculate_stability_score(self, original_likelihood: float, perturbed_likelihoods: List[float]) -> float:
         """
+        Calculate text stability score with improved normalization : AI text typically shows higher stability (larger drops) than human text
         """
         if ((not perturbed_likelihoods) or (original_likelihood <= 0)):
+            # Assume more human-like when no data
+            return 0.3
+        # Calculate relative likelihood drops
+        relative_drops = list()
+        for pl in perturbed_likelihoods:
+            if (pl > 0):
+                # Use relative drop to handle scale differences
+                relative_drop = (original_likelihood - pl) / original_likelihood
+                # Clamp to [0, 1]
+                relative_drops.append(max(0.0, min(1.0, relative_drop)))
+        if not relative_drops:
+            return 0.3
+        avg_relative_drop = np.mean(relative_drops)
+        # Normalization based on empirical observations : AI text typically shows 20-60% drops, human text shows 10-30% drops
+        if (avg_relative_drop > 0.5):
+            # Strong AI indicator
+            stability_score = 0.9
+        elif (avg_relative_drop > 0.3):
+            # 0.6 to 0.9
+            stability_score = 0.6 + (avg_relative_drop - 0.3) * 1.5
+        elif (avg_relative_drop > 0.15):
+            # 0.3 to 0.6
+            stability_score = 0.3 + (avg_relative_drop - 0.15) * 2.0
+        else:
+            # 0.0 to 0.3
+            stability_score = avg_relative_drop * 2.0
+        return min(1.0, max(0.0, stability_score))
     def _calculate_curvature_score(self, original_likelihood: float, perturbed_likelihoods: List[float]) -> float:
         """
+        Calculate likelihood curvature score with better scaling : Measures how "curved" the likelihood surface is around the text
         """
         if ((not perturbed_likelihoods) or (original_likelihood <= 0)):
+            return 0.3
         # Calculate variance of likelihood changes
         likelihood_changes = [abs(original_likelihood - pl) for pl in perturbed_likelihoods]
+        if (len(likelihood_changes) < 2):
+            return 0.3
+        change_variance = np.var(likelihood_changes)
+        # Typical variance for meaningful analysis is around 0.1-0.5 : Adjusted scaling
+        curvature_score = min(1.0, change_variance * 3.0)
         return curvature_score
             if (len(chunk) > 50):
                 try:
+                    chunk_likelihood = self._calculate_likelihood(text = chunk)
                     if (chunk_likelihood > 0):
                         # Generate a simple perturbation for this chunk
                             indices_to_keep      = np.random.choice(len(chunk_words), len(chunk_words) - delete_count, replace=False)
                             perturbed_chunk      = ' '.join([chunk_words[i] for i in sorted(indices_to_keep)])
+                            perturbed_likelihood = self._calculate_likelihood(text = perturbed_chunk)
                             if (perturbed_likelihood > 0):
                                 stability = (chunk_likelihood - perturbed_likelihood) / chunk_likelihood
                                 stabilities.append(min(1.0, max(0.0, stability)))
                 except Exception:
                     continue
     def _analyze_stability_patterns(self, features: Dict[str, Any]) -> tuple:
         """
+        Analyze MultiPerturbationStability patterns with better feature weighting
         """
         # Check feature validity first
         required_features = ['stability_score', 'curvature_score', 'normalized_likelihood_ratio', 'stability_variance', 'perturbation_variance']
         # Initialize ai_indicator list
+        ai_indicators    = list()
+        # Better weighting based on feature reliability
+        stability_weight = 0.3
+        curvature_weight = 0.25
+        ratio_weight     = 0.25
+        variance_weight  = 0.2
         # High stability score suggests AI (larger likelihood drops)
+        stability = features['stability_score']
+        if (stability > 0.7):
+            ai_indicators.append(0.9 * stability_weight)
+        elif (stability > 0.5):
+            ai_indicators.append(0.7 * stability_weight)
+        elif (stability > 0.3):
+            ai_indicators.append(0.5 * stability_weight)
         else:
+            ai_indicators.append(0.2 * stability_weight)
         # High curvature score suggests AI
+        curvature = features['curvature_score']
+        if (curvature > 0.7):
+            ai_indicators.append(0.8 * curvature_weight)
+        elif (curvature > 0.5):
+            ai_indicators.append(0.6 * curvature_weight)
+        elif (curvature > 0.3):
+            ai_indicators.append(0.4 * curvature_weight)
         else:
+            ai_indicators.append(0.2 * curvature_weight)
         # High likelihood ratio suggests AI (original much more likely than perturbations)
+        ratio = features['normalized_likelihood_ratio']
+        if (ratio > 0.8):
+            ai_indicators.append(0.9 * ratio_weight)
+        elif (ratio > 0.6):
+            ai_indicators.append(0.7 * ratio_weight)
+        elif (ratio > 0.4):
+            ai_indicators.append(0.5 * ratio_weight)
         else:
+            ai_indicators.append(0.3 * ratio_weight)
         # Low stability variance suggests AI (consistent across chunks)
+        stability_var = features['stability_variance']
+        if (stability_var < 0.05):
+            ai_indicators.append(0.8 * variance_weight)
+        elif (stability_var < 0.1):
+            ai_indicators.append(0.5 * variance_weight)
         else:
+            ai_indicators.append(0.2 * variance_weight)
         # Calculate raw score and confidence
+        if ai_indicators:
+            raw_score  = sum(ai_indicators)
+            confidence = 0.5 + (0.5 * (1.0 - (np.std([x / (weights := [stability_weight, curvature_weight, ratio_weight, variance_weight])[i] for i, x in enumerate(ai_indicators)]) if len(ai_indicators) > 1 else 0.5)))
+        else:
+            raw_score  = 0.5
+            confidence = 0.3
         confidence = max(0.1, min(0.9, confidence))
         return raw_score, confidence
     def _get_default_features(self) -> Dict[str, Any]:
         """
+        Return more meaningful default features
         """
         return {"original_likelihood"         : 2.0,
                 "avg_perturbed_likelihood"    : 1.8,
                 "likelihood_ratio"            : 1.1,
                 "normalized_likelihood_ratio" : 0.55,
+                "stability_score"             : 0.3,
+                "curvature_score"             : 0.3,
                 "perturbation_variance"       : 0.05,
+                "avg_chunk_stability"         : 0.3,
                 "stability_variance"          : 0.1,
                 "num_perturbations"           : 0,
                 "num_valid_perturbations"     : 0,
         # Normalize whitespace
         text = ' '.join(text.split())
+        # DistilRoBERTa works better with proper punctuation
         if not text.endswith(('.', '!', '?')):
             text += '.'
         # Truncate to safe length
         if (len(text) > 1000):
             sentences = text.split('. ')
+            if (len(sentences) > 1):
                 # Keep first few sentences
                 text = '. '.join(sentences[:3]) + '.'
         return text
     def _clean_roberta_token(self, token: str) -> str:
         """
+        Clean tokens from DistilRoBERTa tokenizer
         """
         if not token:
             return ""
+        # Remove DistilRoBERTa-specific artifacts
         token = token.replace('Ġ', ' ')  # RoBERTa space marker
         token = token.replace('</s>', '')
         token = token.replace('<s>', '')
         token = token.replace('<pad>', '')
+        token = token.replace('<mask>', '')
+        # Remove leading/trailing whitespace
+        token = token.strip()
+        # Only remove punctuation if token is ONLY punctuation
+        if token and not token.replace('.', '').replace(',', '').replace('!', '').replace('?', '').strip():
+            return ""
+        # Keep the token if it has at least 2 alphanumeric characters
+        if sum(c.isalnum() for c in token) >= 2:
+            return token
+        return ""
     def _is_valid_perturbation(self, perturbed_text: str, original_text: str) -> bool:
         """
+        Check if a perturbation is valid (more lenient validation)
         """
+        if (not perturbed_text or not perturbed_text.strip()):
+            return False
+        # Must be different from original
+        if (perturbed_text == original_text):
+            return False
+        # Lenient length check
+        if (len(perturbed_text) < len(original_text) * 0.3):
+            return False
+        # Must have some actual content
+        if len(perturbed_text.strip()) < 5:
+            return False
+        return True
     def cleanup(self):

models/model_manager.py CHANGED Viewed

@@ -21,6 +21,7 @@ from transformers import AutoTokenizer
 from transformers import GPT2LMHeadModel
 from config.model_config import ModelType
 from config.model_config import ModelConfig
 from transformers import AutoModelForMaskedLM
 from config.model_config import MODEL_REGISTRY
 from config.model_config import get_model_config
@@ -237,6 +238,12 @@ class ModelManager:
             elif (model_config.model_type == ModelType.TRANSFORMER):
                 model = self._load_transformer(config = model_config)
             elif (model_config.model_type == ModelType.RULE_BASED):
                 # Check if it's a spaCy model
                 if model_config.additional_params.get("is_spacy_model", False):
@@ -288,7 +295,13 @@ class ModelManager:
         logger.info(f"Loading tokenizer for: {model_name}")
         try:
-            if (model_config.model_type in [ModelType.GPT, ModelType.CLASSIFIER, ModelType.SEQUENCE_CLASSIFICATION, ModelType.TRANSFORMER]):
                 tokenizer = AutoTokenizer.from_pretrained(pretrained_model_name_or_path = model_config.model_id,
                                                           cache_dir                     = str(self.cache_dir),
                                                          )
@@ -339,6 +352,54 @@ class ModelManager:
         return (model, tokenizer)
     def _load_classifier(self, config: ModelConfig) -> Any:
         """
         Load classification model (for zero-shot, etc.)
@@ -483,7 +544,7 @@ class ModelManager:
         logger.info(f"Downloading model: {model_name} ({model_config.model_id})")
         try:
-            if model_config.model_type == ModelType.SENTENCE_TRANSFORMER:
                 SentenceTransformer(model_name_or_path = model_config.model_id,
                                     cache_folder       = str(self.cache_dir),
                                    )
@@ -506,6 +567,24 @@ class ModelManager:
                                               cache_dir                     = str(self.cache_dir),
                                              )
             elif (model_config.model_type == ModelType.RULE_BASED):
                 if model_config.additional_params.get("is_spacy_model", False):
                     subprocess.run(["python", "-m", "spacy", "download", model_config.model_id], check = True)

 from transformers import GPT2LMHeadModel
 from config.model_config import ModelType
 from config.model_config import ModelConfig
+from transformers import AutoModelForCausalLM
 from transformers import AutoModelForMaskedLM
 from config.model_config import MODEL_REGISTRY
 from config.model_config import get_model_config
             elif (model_config.model_type == ModelType.TRANSFORMER):
                 model = self._load_transformer(config = model_config)
+            elif (model_config.model_type == ModelType.CAUSAL_LM):
+                model = self._load_causal_lm(config = model_config)
+            elif (model_config.model_type == ModelType.MASKED_LM):
+                model = self._load_masked_lm(config = model_config)
             elif (model_config.model_type == ModelType.RULE_BASED):
                 # Check if it's a spaCy model
                 if model_config.additional_params.get("is_spacy_model", False):
         logger.info(f"Loading tokenizer for: {model_name}")
         try:
+            if (model_config.model_type in [ModelType.GPT,
+                                            ModelType.CLASSIFIER,
+                                            ModelType.SEQUENCE_CLASSIFICATION,
+                                            ModelType.TRANSFORMER,
+                                            ModelType.CAUSAL_LM,
+                                            ModelType.MASKED_LM]):
                 tokenizer = AutoTokenizer.from_pretrained(pretrained_model_name_or_path = model_config.model_id,
                                                           cache_dir                     = str(self.cache_dir),
                                                          )
         return (model, tokenizer)
+    def _load_causal_lm(self, config: ModelConfig) -> tuple:
+        """
+        Load causal language model (like GPT-2) for text generation
+        """
+        model     = AutoModelForCausalLM.from_pretrained(pretrained_model_name_or_path = config.model_id,
+                                                         cache_dir                     = str(self.cache_dir),
+                                                        )
+        tokenizer = AutoTokenizer.from_pretrained(pretrained_model_name_or_path = config.model_id,
+                                                  cache_dir                     = str(self.cache_dir),
+                                                 )
+        # Move to device
+        model     = model.to(self.device)
+        model.eval()
+        # Apply quantization if enabled
+        if (settings.USE_QUANTIZATION and config.quantizable):
+            model = self._quantize_model(model = model)
+        return (model, tokenizer)
+    def _load_masked_lm(self, config: ModelConfig) -> tuple:
+        """
+        Load masked language model (like RoBERTa) for fill-mask tasks
+        """
+        model     = AutoModelForMaskedLM.from_pretrained(pretrained_model_name_or_path = config.model_id,
+                                                         cache_dir                     = str(self.cache_dir),
+                                                        )
+        tokenizer = AutoTokenizer.from_pretrained(pretrained_model_name_or_path = config.model_id,
+                                                  cache_dir                     = str(self.cache_dir),
+                                                 )
+        # Move to device
+        model     = model.to(self.device)
+        model.eval()
+        # Apply quantization if enabled
+        if (settings.USE_QUANTIZATION and config.quantizable):
+            model = self._quantize_model(model = model)
+        return (model, tokenizer)
     def _load_classifier(self, config: ModelConfig) -> Any:
         """
         Load classification model (for zero-shot, etc.)
         logger.info(f"Downloading model: {model_name} ({model_config.model_id})")
         try:
+            if (model_config.model_type == ModelType.SENTENCE_TRANSFORMER):
                 SentenceTransformer(model_name_or_path = model_config.model_id,
                                     cache_folder       = str(self.cache_dir),
                                    )
                                               cache_dir                     = str(self.cache_dir),
                                              )
+            elif (model_config.model_type == ModelType.CAUSAL_LM):
+                AutoModelForCausalLM.from_pretrained(pretrained_model_name_or_path = model_config.model_id,
+                                                     cache_dir                     = str(self.cache_dir),
+                                                    )
+                AutoTokenizer.from_pretrained(pretrained_model_name_or_path = model_config.model_id,
+                                              cache_dir                     = str(self.cache_dir),
+                                             )
+            elif (model_config.model_type == ModelType.MASKED_LM):
+                AutoModelForMaskedLM.from_pretrained(pretrained_model_name_or_path = model_config.model_id,
+                                                     cache_dir                     = str(self.cache_dir),
+                                                    )
+                AutoTokenizer.from_pretrained(pretrained_model_name_or_path = model_config.model_id,
+                                              cache_dir                     = str(self.cache_dir),
+                                             )
             elif (model_config.model_type == ModelType.RULE_BASED):
                 if model_config.additional_params.get("is_spacy_model", False):
                     subprocess.run(["python", "-m", "spacy", "download", model_config.model_id], check = True)

reporter/report_generator.py CHANGED Viewed

@@ -79,6 +79,9 @@ class ReportGenerator:
         --------
                 { dict }          : Dictionary mapping format to filepath
         """
         # Generate detailed reasoning
         reasoning        = self.reasoning_generator.generate(ensemble_result    = detection_result.ensemble_result,
                                                              metric_results     = detection_result.metric_results,
@@ -88,7 +91,7 @@ class ReportGenerator:
                                                             )
         # Extract detailed metrics from ACTUAL detection results
-        detailed_metrics = self._extract_detailed_metrics(detection_result)
         # Timestamp for filenames
         timestamp        = datetime.now().strftime("%Y%m%d_%H%M%S")
@@ -97,7 +100,7 @@ class ReportGenerator:
         # Generate requested formats
         if ("json" in formats):
-            json_path               = self._generate_json_report(detection_result      = detection_result,
                                                                  reasoning             = reasoning,
                                                                  detailed_metrics      = detailed_metrics,
                                                                  attribution_result    = attribution_result,
@@ -108,7 +111,7 @@ class ReportGenerator:
         if ("pdf" in formats):
             try:
-                pdf_path               = self._generate_pdf_report(detection_result      = detection_result,
                                                                    reasoning             = reasoning,
                                                                    detailed_metrics      = detailed_metrics,
                                                                    attribution_result    = attribution_result,
@@ -126,26 +129,29 @@ class ReportGenerator:
         return generated_files
-    def _extract_detailed_metrics(self, detection_result: DetectionResult) -> List[DetailedMetric]:
         """
         Extract detailed metrics with sub-metrics from ACTUAL detection result
         """
         detailed_metrics = list()
-        metric_results   = detection_result.metric_results
-        ensemble_result  = detection_result.ensemble_result
         # Get actual metric weights from ensemble
-        metric_weights   = getattr(ensemble_result, 'metric_weights', {})
         # Extract actual metric data
-        for metric_name, metric_result in metric_results.items():
-            if metric_result.error is not None:
                 continue
             # Get actual probabilities and confidence
-            ai_prob    = metric_result.ai_probability * 100
-            human_prob = metric_result.human_probability * 100
-            confidence = metric_result.confidence * 100
             # Determine verdict based on actual probability
             if (ai_prob >= 60):
@@ -158,7 +164,9 @@ class ReportGenerator:
                 verdict = "MIXED (AI + HUMAN)"
             # Get actual weight or use default
-            weight                = metric_weights.get(metric_name, 0.0) * 100
             # Extract actual detailed metrics from metric result
             detailed_metrics_data = self._extract_metric_details(metric_name   = metric_name,
@@ -182,22 +190,22 @@ class ReportGenerator:
         return detailed_metrics
-    def _extract_metric_details(self, metric_name: str, metric_result) -> Dict[str, float]:
         """
         Extract detailed sub-metrics from metric result
         """
         details = dict()
         # Try to get details from metric result
-        if ((hasattr(metric_result, 'details')) and metric_result.details):
-            details = metric_result.details.copy()
         # If no details available, provide basic calculated values
         if not details:
-            details = {"ai_probability"    : metric_result.ai_probability * 100,
-                       "human_probability" : metric_result.human_probability * 100,
-                       "confidence"        : metric_result.confidence * 100,
-                       "score"             : getattr(metric_result, 'score', 0.0) * 100,
                       }
         return details
@@ -218,7 +226,7 @@ class ReportGenerator:
         return descriptions.get(metric_name, "Advanced text analysis metric.")
-    def _generate_json_report(self, detection_result: DetectionResult, reasoning: DetailedReasoning, detailed_metrics: List[DetailedMetric],
                               attribution_result: Optional[AttributionResult], highlighted_sentences: Optional[List] = None, filename: str = None) -> Path:
         """
         Generate JSON format report with detailed metrics
@@ -251,7 +259,7 @@ class ReportGenerator:
                                          "index"          : sent.index,
                                        })
-        # Attribution data - use attribution_result
         attribution_data = None
         if attribution_result:
@@ -264,30 +272,32 @@ class ReportGenerator:
                                 "metric_contributions": attribution_result.metric_contributions,
                                }
-        # Use ACTUAL detection results with ensemble integration
-        ensemble_result = detection_result.ensemble_result
         report_data     = {"report_metadata"     : {"generated_at" : datetime.now().isoformat(),
                                                     "version"      : "1.0.0",
                                                     "format"       : "json",
                                                     "report_id"    : filename.replace('.json', ''),
                                                    },
-                           "overall_results"     : {"final_verdict"      : ensemble_result.final_verdict,
-                                                    "ai_probability"     : round(ensemble_result.ai_probability, 4),
-                                                    "human_probability"  : round(ensemble_result.human_probability, 4),
-                                                    "mixed_probability"  : round(ensemble_result.mixed_probability, 4),
-                                                    "overall_confidence" : round(ensemble_result.overall_confidence, 4),
-                                                    "uncertainty_score"  : round(ensemble_result.uncertainty_score, 4),
-                                                    "consensus_level"    : round(ensemble_result.consensus_level, 4),
-                                                    "domain"             : detection_result.domain_prediction.primary_domain.value,
-                                                    "domain_confidence"  : round(detection_result.domain_prediction.confidence, 4),
-                                                    "text_length"        : detection_result.processed_text.word_count,
-                                                    "sentence_count"     : detection_result.processed_text.sentence_count,
                                                    },
                            "ensemble_analysis"   : {"method_used"     : "confidence_calibrated",
-                                                    "metric_weights"  : {name: round(weight, 4) for name, weight in ensemble_result.metric_weights.items()},
-                                                    "weighted_scores" : {name: round(score, 4) for name, score in ensemble_result.weighted_scores.items()},
-                                                    "reasoning"       : ensemble_result.reasoning,
                                                    },
                            "detailed_metrics"    : metrics_data,
                            "detection_reasoning" : {"summary"                : reasoning.summary,
@@ -303,10 +313,10 @@ class ReportGenerator:
                                                    },
                            "highlighted_text"    : highlighted_data,
                            "model_attribution"   : attribution_data,
-                           "performance_metrics" : {"total_processing_time"  : round(detection_result.processing_time, 3),
-                                                    "metrics_execution_time" : {name: round(time, 3) for name, time in detection_result.metrics_execution_time.items()},
-                                                    "warnings"               : detection_result.warnings,
-                                                    "errors"                 : detection_result.errors,
                                                    }
                           }
@@ -323,7 +333,7 @@ class ReportGenerator:
         return output_path
-    def _generate_pdf_report(self, detection_result: DetectionResult, reasoning: DetailedReasoning, detailed_metrics: List[DetailedMetric],
                              attribution_result: Optional[AttributionResult], highlighted_sentences: Optional[List] = None, filename: str = None) -> Path:
         """
         Generate PDF format report with detailed metrics
@@ -378,8 +388,9 @@ class ReportGenerator:
                                        spaceAfter = 8,
                                       )
-        # Use detection results with ensemble integration
-        ensemble_result = detection_result.ensemble_result
         # Title and main sections
         elements.append(Paragraph("AI Text Detection Analysis Report", title_style))
@@ -388,13 +399,13 @@ class ReportGenerator:
         # Verdict section with ensemble metrics
         elements.append(Paragraph("Detection Summary", heading_style))
-        verdict_data = [['Final Verdict:', ensemble_result.final_verdict],
-                        ['AI Probability:', f"{ensemble_result.ai_probability:.1%}"],
-                        ['Human Probability:', f"{ensemble_result.human_probability:.1%}"],
-                        ['Mixed Probability:', f"{ensemble_result.mixed_probability:.1%}"],
-                        ['Overall Confidence:', f"{ensemble_result.overall_confidence:.1%}"],
-                        ['Uncertainty Score:', f"{ensemble_result.uncertainty_score:.1%}"],
-                        ['Consensus Level:', f"{ensemble_result.consensus_level:.1%}"],
                        ]
         verdict_table = Table(verdict_data, colWidths=[2*inch, 3*inch])
@@ -410,11 +421,11 @@ class ReportGenerator:
         # Content analysis
         elements.append(Paragraph("Content Analysis", heading_style))
-        content_data = [['Content Domain:', detection_result.domain_prediction.primary_domain.value.title()],
-                        ['Domain Confidence:', f"{detection_result.domain_prediction.confidence:.1%}"],
-                        ['Word Count:', str(detection_result.processed_text.word_count)],
-                        ['Sentence Count:', str(detection_result.processed_text.sentence_count)],
-                        ['Processing Time:', f"{detection_result.processing_time:.2f}s"],
                        ]
         content_table = Table(content_data, colWidths=[2*inch, 3*inch])
@@ -428,14 +439,16 @@ class ReportGenerator:
         # Ensemble Analysis
         elements.append(Paragraph("Ensemble Analysis", heading_style))
-        elements.append(Paragraph(f"Method: Confidence Calibrated Aggregation", styles['Normal']))
         elements.append(Spacer(1, 0.1*inch))
         # Metric weights table
-        if hasattr(ensemble_result, 'metric_weights') and ensemble_result.metric_weights:
             elements.append(Paragraph("Metric Weights", styles['Heading3']))
             weight_data = [['Metric', 'Weight']]
-            for metric, weight in ensemble_result.metric_weights.items():
                 weight_data.append([metric.title(), f"{weight:.1%}"])
             weight_table = Table(weight_data, colWidths=[3*inch, 1*inch])
@@ -578,8 +591,8 @@ class ReportGenerator:
         # Footer
         elements.append(Spacer(1, 0.3*inch))
-        elements.append(Paragraph(f"Generated by AI Text Detector v2.0 | Processing Time: {detection_result.processing_time:.2f}s",
-                                ParagraphStyle('Footer', parent=styles['Normal'], fontSize=8, textColor=colors.gray)))
         # Build PDF
         doc.build(elements)

         --------
                 { dict }          : Dictionary mapping format to filepath
         """
+        # Convert DetectionResult to dict for consistent access
+        detection_dict = detection_result.to_dict() if hasattr(detection_result, 'to_dict') else detection_result
         # Generate detailed reasoning
         reasoning        = self.reasoning_generator.generate(ensemble_result    = detection_result.ensemble_result,
                                                              metric_results     = detection_result.metric_results,
                                                             )
         # Extract detailed metrics from ACTUAL detection results
+        detailed_metrics = self._extract_detailed_metrics(detection_dict)
         # Timestamp for filenames
         timestamp        = datetime.now().strftime("%Y%m%d_%H%M%S")
         # Generate requested formats
         if ("json" in formats):
+            json_path               = self._generate_json_report(detection_dict        = detection_dict,
                                                                  reasoning             = reasoning,
                                                                  detailed_metrics      = detailed_metrics,
                                                                  attribution_result    = attribution_result,
         if ("pdf" in formats):
             try:
+                pdf_path               = self._generate_pdf_report(detection_dict        = detection_dict,
                                                                    reasoning             = reasoning,
                                                                    detailed_metrics      = detailed_metrics,
                                                                    attribution_result    = attribution_result,
         return generated_files
+    def _extract_detailed_metrics(self, detection_dict: Dict) -> List[DetailedMetric]:
         """
         Extract detailed metrics with sub-metrics from ACTUAL detection result
         """
         detailed_metrics = list()
+        metrics_data     = detection_dict.get("metrics", {})
+        ensemble_data    = detection_dict.get("ensemble", {})
         # Get actual metric weights from ensemble
+        metric_weights   = ensemble_data.get("metric_contributions", {})
         # Extract actual metric data
+        for metric_name, metric_result in metrics_data.items():
+            if not isinstance(metric_result, dict):
+                continue
+            if metric_result.get("error") is not None:
                 continue
             # Get actual probabilities and confidence
+            ai_prob    = metric_result.get("ai_probability", 0) * 100
+            human_prob = metric_result.get("human_probability", 0) * 100
+            confidence = metric_result.get("confidence", 0) * 100
             # Determine verdict based on actual probability
             if (ai_prob >= 60):
                 verdict = "MIXED (AI + HUMAN)"
             # Get actual weight or use default
+            weight = 0.0
+            if metric_name in metric_weights:
+                weight = metric_weights[metric_name].get("weight", 0.0) * 100
             # Extract actual detailed metrics from metric result
             detailed_metrics_data = self._extract_metric_details(metric_name   = metric_name,
         return detailed_metrics
+    def _extract_metric_details(self, metric_name: str, metric_result: Dict) -> Dict[str, float]:
         """
         Extract detailed sub-metrics from metric result
         """
         details = dict()
         # Try to get details from metric result
+        if metric_result.get("details"):
+            details = metric_result["details"].copy()
         # If no details available, provide basic calculated values
         if not details:
+            details = {"ai_probability"    : metric_result.get("ai_probability", 0) * 100,
+                       "human_probability" : metric_result.get("human_probability", 0) * 100,
+                       "confidence"        : metric_result.get("confidence", 0) * 100,
+                       "score"             : metric_result.get("score", 0) * 100,
                       }
         return details
         return descriptions.get(metric_name, "Advanced text analysis metric.")
+    def _generate_json_report(self, detection_dict: Dict, reasoning: DetailedReasoning, detailed_metrics: List[DetailedMetric],
                               attribution_result: Optional[AttributionResult], highlighted_sentences: Optional[List] = None, filename: str = None) -> Path:
         """
         Generate JSON format report with detailed metrics
                                          "index"          : sent.index,
                                        })
+        # Attribution data
         attribution_data = None
         if attribution_result:
                                 "metric_contributions": attribution_result.metric_contributions,
                                }
+        # Use ACTUAL detection results from dictionary
+        ensemble_data = detection_dict.get("ensemble", {})
+        analysis_data = detection_dict.get("analysis", {})
+        metrics_data_dict = detection_dict.get("metrics", {})
+        performance_data = detection_dict.get("performance", {})
         report_data     = {"report_metadata"     : {"generated_at" : datetime.now().isoformat(),
                                                     "version"      : "1.0.0",
                                                     "format"       : "json",
                                                     "report_id"    : filename.replace('.json', ''),
                                                    },
+                           "overall_results"     : {"final_verdict"      : ensemble_data.get("final_verdict", "Unknown"),
+                                                    "ai_probability"     : ensemble_data.get("ai_probability", 0),
+                                                    "human_probability"  : ensemble_data.get("human_probability", 0),
+                                                    "mixed_probability"  : ensemble_data.get("mixed_probability", 0),
+                                                    "overall_confidence" : ensemble_data.get("overall_confidence", 0),
+                                                    "uncertainty_score"  : ensemble_data.get("uncertainty_score", 0),
+                                                    "consensus_level"    : ensemble_data.get("consensus_level", 0),
+                                                    "domain"             : analysis_data.get("domain", "general"),
+                                                    "domain_confidence"  : analysis_data.get("domain_confidence", 0),
+                                                    "text_length"        : analysis_data.get("text_length", 0),
+                                                    "sentence_count"     : analysis_data.get("sentence_count", 0),
                                                    },
                            "ensemble_analysis"   : {"method_used"     : "confidence_calibrated",
+                                                    "metric_weights"  : ensemble_data.get("metric_contributions", {}),
+                                                    "reasoning"       : ensemble_data.get("reasoning", []),
                                                    },
                            "detailed_metrics"    : metrics_data,
                            "detection_reasoning" : {"summary"                : reasoning.summary,
                                                    },
                            "highlighted_text"    : highlighted_data,
                            "model_attribution"   : attribution_data,
+                           "performance_metrics" : {"total_processing_time"  : performance_data.get("total_time", 0),
+                                                    "metrics_execution_time" : performance_data.get("metrics_time", {}),
+                                                    "warnings"               : detection_dict.get("warnings", []),
+                                                    "errors"                 : detection_dict.get("errors", []),
                                                    }
                           }
         return output_path
+    def _generate_pdf_report(self, detection_dict: Dict, reasoning: DetailedReasoning, detailed_metrics: List[DetailedMetric],
                              attribution_result: Optional[AttributionResult], highlighted_sentences: Optional[List] = None, filename: str = None) -> Path:
         """
         Generate PDF format report with detailed metrics
                                        spaceAfter = 8,
                                       )
+        # Use detection results from dictionary
+        ensemble_data = detection_dict.get("ensemble", {})
+        analysis_data = detection_dict.get("analysis", {})
         # Title and main sections
         elements.append(Paragraph("AI Text Detection Analysis Report", title_style))
         # Verdict section with ensemble metrics
         elements.append(Paragraph("Detection Summary", heading_style))
+        verdict_data = [['Final Verdict:', ensemble_data.get("final_verdict", "Unknown")],
+                        ['AI Probability:', f"{ensemble_data.get('ai_probability', 0):.1%}"],
+                        ['Human Probability:', f"{ensemble_data.get('human_probability', 0):.1%}"],
+                        ['Mixed Probability:', f"{ensemble_data.get('mixed_probability', 0):.1%}"],
+                        ['Overall Confidence:', f"{ensemble_data.get('overall_confidence', 0):.1%}"],
+                        ['Uncertainty Score:', f"{ensemble_data.get('uncertainty_score', 0):.1%}"],
+                        ['Consensus Level:', f"{ensemble_data.get('consensus_level', 0):.1%}"],
                        ]
         verdict_table = Table(verdict_data, colWidths=[2*inch, 3*inch])
         # Content analysis
         elements.append(Paragraph("Content Analysis", heading_style))
+        content_data = [['Content Domain:', analysis_data.get("domain", "general").title()],
+                        ['Domain Confidence:', f"{analysis_data.get('domain_confidence', 0):.1%}"],
+                        ['Word Count:', str(analysis_data.get("text_length", 0))],
+                        ['Sentence Count:', str(analysis_data.get("sentence_count", 0))],
+                        ['Processing Time:', f"{detection_dict.get('performance', {}).get('total_time', 0):.2f}s"],
                        ]
         content_table = Table(content_data, colWidths=[2*inch, 3*inch])
         # Ensemble Analysis
         elements.append(Paragraph("Ensemble Analysis", heading_style))
+        elements.append(Paragraph("Method: Confidence Calibrated Aggregation", styles['Normal']))
         elements.append(Spacer(1, 0.1*inch))
         # Metric weights table
+        metric_contributions = ensemble_data.get("metric_contributions", {})
+        if metric_contributions:
             elements.append(Paragraph("Metric Weights", styles['Heading3']))
             weight_data = [['Metric', 'Weight']]
+            for metric, contribution in metric_contributions.items():
+                weight = contribution.get("weight", 0)
                 weight_data.append([metric.title(), f"{weight:.1%}"])
             weight_table = Table(weight_data, colWidths=[3*inch, 1*inch])
         # Footer
         elements.append(Spacer(1, 0.3*inch))
+        elements.append(Paragraph(f"Generated by AI Text Detector v2.0 | Processing Time: {detection_dict.get('performance', {}).get('total_time', 0):.2f}s",
+                        ParagraphStyle('Footer', parent=styles['Normal'], fontSize=8, textColor=colors.gray)))
         # Build PDF
         doc.build(elements)

requirements.txt CHANGED Viewed

@@ -1,98 +1,56 @@
 # Core Framework
-fastapi==0.104.1
-uvicorn[standard]==0.24.0
-pydantic==2.5.0
-pydantic-settings==2.1.0
-python-multipart==0.0.6
 # Machine Learning & Transformers
-torch==2.1.0
-transformers==4.35.2
-sentence-transformers==2.2.2
-tokenizers==0.15.0
 # NLP Libraries
-spacy==3.7.2
-#flair==0.13.1
-nltk==3.8.1
-textstat==0.7.3
 # Scientific Computing
-numpy==1.24.3
-scipy==1.11.4
-scikit-learn==1.3.2
-pandas==2.1.3
 # Text Processing
-python-docx==1.1.0
 PyPDF2==3.0.1
-pdfplumber==0.10.3
-pymupdf==1.23.8
 python-magic==0.4.27
 # Language Detection
 langdetect==1.0.9
-#fasttext==0.9.2
-# Adversarial & Robustness
-#textattack==0.3.8
 # Visualization & Reporting
-matplotlib==3.8.2
 seaborn==0.13.0
-plotly==5.18.0
-reportlab==4.0.7
-fpdf2==2.7.6
 # Utilities
-python-dotenv==1.0.0
 aiofiles==23.2.1
-httpx==0.25.2
-tenacity==8.2.3
 # Logging & Monitoring
-loguru==0.7.2
-python-json-logger==2.0.7
 # Caching
-redis==5.0.1
 diskcache==5.6.3
-# Database (Optional)
-sqlalchemy==2.0.23
-alembic==1.13.0
-# Testing
-pytest==7.4.3
-pytest-asyncio==0.21.1
-pytest-cov==4.1.0
-# Code Quality
-black==23.12.0
-flake8==6.1.0
-mypy==1.7.1
-# Security
-cryptography==41.0.7
-python-jose[cryptography]==3.3.0
-# Performance
-orjson==3.9.10
-ujson==5.9.0
-# Additional ML Tools
-xgboost==2.0.2
-lightgbm==4.1.0
-# Dimensionality Analysis
-#scikit-dimension==0.3.5
-umap-learn==0.5.5
-# Rate Limiting
-slowapi==0.1.9
-# CORS
-fastapi-cors==0.0.6
-# File type detection
-python-magic-bin==0.4.14

 # Core Framework
+fastapi==0.115.6
+uvicorn==0.34.0
+pydantic==2.11.4
+pydantic-settings==2.11.0
+python-multipart==0.0.20
 # Machine Learning & Transformers
+torch==2.3.1
+transformers==4.48.0
+sentence-transformers==3.3.1
+tokenizers==0.21.0
+huggingface-hub==0.27.0
 # NLP Libraries
+spacy==3.8.3
+nltk==3.9.1
+textstat==0.7.10
 # Scientific Computing
+numpy==1.23.5
+scipy==1.12.0
+scikit-learn==1.6.0
+pandas==2.2.3
 # Text Processing
+python-docx==1.1.2
 PyPDF2==3.0.1
+pdfplumber==0.11.5
+pymupdf==1.25.5
 python-magic==0.4.27
 # Language Detection
 langdetect==1.0.9
 # Visualization & Reporting
+matplotlib==3.8.0
 seaborn==0.13.0
+reportlab==4.2.2
 # Utilities
+python-dotenv==1.0.1
 aiofiles==23.2.1
+httpx==0.27.0
+tenacity==9.1.2
 # Logging & Monitoring
+loguru==0.7.3
 # Caching
 diskcache==5.6.3
+# Additional packages from your working environment
+safetensors==0.4.4
+accelerate==1.2.1
+protobuf==4.25.4

setup.sh ADDED Viewed

	@@ -0,0 +1,22 @@

+#!/bin/bash
+# Post-installation setup script for Hugging Face Spaces
+echo "Starting setup for Text-Authentication Platform ..."
+# Download Spacy Model
+echo "Downloading SpaCy English model ..."
+python -n spacy download en_core_web_sm
+# Download NLTK data
+echo "Downloading NLTK data ..."
+python -c "import nltk; nltk.download('punkt'); nltk.download('stopwords'); nltk.download('averaged_perceptron_tagger')"
+# Create necessary directories
+echo "Creating directories ..."
+mkdir -p data/reports data/uploads
+# Verify installation
+echo "Verifying installations ..."
+python -c "import transformers; import torch; import spacy; print('All core libraries imported successfully.')"
+echo "Setup complete !"

text_auth_app.py CHANGED Viewed

@@ -1245,8 +1245,6 @@ async def log_requests(request: Request, call_next):
     return response
 # ==================== MAIN ====================
 if __name__ == "__main__":
     # Configure logging

     return response
 # ==================== MAIN ====================
 if __name__ == "__main__":
     # Configure logging

ui/static/index.html CHANGED Viewed

@@ -273,7 +273,6 @@ body {
     padding: 2rem;
     border: 1px solid var(--border);
     backdrop-filter: blur(10px);
-    /* Changed from fixed height to use available space */
     height: 850px;
     overflow: hidden;
     display: flex;
@@ -621,7 +620,7 @@ input[type="checkbox"] {
     color: var(--text-secondary);
     line-height: 1.7;
 }
-/* Enhanced Reasoning Styles */
 .reasoning-box.enhanced {
     background: linear-gradient(135deg, rgba(30, 41, 59, 0.95) 0%, rgba(15, 23, 42, 0.95) 100%);
     border: 1px solid rgba(71, 85, 105, 0.5);
@@ -703,7 +702,7 @@ input[type="checkbox"] {
 .metric-indicator {
     display: flex;
     justify-content: space-between;
-    align-items: center;
     padding: 0.75rem;
     margin-bottom: 0.5rem;
     border-radius: 8px;
@@ -714,7 +713,7 @@ input[type="checkbox"] {
     transform: translateX(4px);
 }
 .metric-name {
-    font-weight: 600;
     color: var(--text-primary);
     min-width: 140px;
 }
@@ -795,6 +794,23 @@ input[type="checkbox"] {
     font-weight: 700;
     color: var(--primary);
 }
 /* Download Actions */
 .download-actions {
     display: flex;
@@ -959,21 +975,16 @@ input[type="checkbox"] {
 }
 .metrics-carousel-content {
     flex: 1;
-    /* Removed padding and centering to allow content to fill space */
     padding: 0;
-    /* Removed align-items: center; justify-content: center; to let content take natural space */
     display: flex;
     align-items: flex-start;
     justify-content: flex-start;
     overflow-y: auto;
-    /* Added some internal spacing for readability */
     padding: 1rem;
-    /* min-height: 600px; */
 }
 .metric-slide {
     display: none;
     width: 100%;
-    /* Reduced padding to make card tighter */
     padding: 1rem;
 }
 .metric-slide.active {
@@ -1011,6 +1022,43 @@ input[type="checkbox"] {
     color: var(--text-secondary);
     font-weight: 600;
 }
 /* Responsive */
 @media (max-width: 1200px) {
     .interface-grid {
@@ -1222,7 +1270,7 @@ html {
                     id="text-input"
                     class="text-input"
                     placeholder="Paste your text here for analysis...
-The more text you provide (minimum 50 characters), the more accurate the detection will be. Our system analyzes linguistic patterns, statistical features, and semantic structures to determine authenticity."
                 ></textarea>
             </div>
             <div id="upload-tab" class="tab-content">
@@ -1351,18 +1399,21 @@ const API_BASE = '';
 let currentAnalysisData = null;
 let currentMetricIndex = 0;
 let totalMetrics = 0;
 // Navigation
 function showLanding() {
     document.getElementById('landing-page').style.display = 'block';
     document.getElementById('analysis-interface').style.display = 'none';
     window.scrollTo(0, 0);
 }
 function showAnalysis() {
     document.getElementById('landing-page').style.display = 'none';
     document.getElementById('analysis-interface').style.display = 'block';
     window.scrollTo(0, 0);
     resetAnalysisInterface();
 }
 // Reset analysis interface
 function resetAnalysisInterface() {
     // Clear text input
@@ -1419,6 +1470,7 @@ function resetAnalysisInterface() {
     currentMetricIndex = 0;
     totalMetrics = 0;
 }
 // Input Tab Switching
 document.querySelectorAll('.input-tab').forEach(tab => {
     tab.addEventListener('click', () => {
@@ -1431,6 +1483,7 @@ document.querySelectorAll('.input-tab').forEach(tab => {
         document.getElementById(`${tabName}-tab`).classList.add('active');
     });
 });
 // Report Tab Switching
 document.querySelectorAll('.report-tab').forEach(tab => {
     tab.addEventListener('click', () => {
@@ -1443,24 +1496,30 @@ document.querySelectorAll('.report-tab').forEach(tab => {
         document.getElementById(`${reportName}-report`).classList.add('active');
     });
 });
 // File Upload Handling
 const fileInput = document.getElementById('file-input');
 const fileUploadArea = document.getElementById('file-upload-area');
 const fileNameDisplay = document.getElementById('file-name-display');
 fileUploadArea.addEventListener('click', () => {
     fileInput.click();
 });
 fileInput.addEventListener('change', (e) => {
     handleFileSelect(e.target.files[0]);
 });
 // Drag and Drop
 fileUploadArea.addEventListener('dragover', (e) => {
     e.preventDefault();
     fileUploadArea.classList.add('drag-over');
 });
 fileUploadArea.addEventListener('dragleave', () => {
     fileUploadArea.classList.remove('drag-over');
 });
 fileUploadArea.addEventListener('drop', (e) => {
     e.preventDefault();
     fileUploadArea.classList.remove('drag-over');
@@ -1470,6 +1529,7 @@ fileUploadArea.addEventListener('drop', (e) => {
         handleFileSelect(file);
     }
 });
 function handleFileSelect(file) {
     if (!file) return;
     const allowedTypes = ['.txt', '.pdf', '.docx', '.doc', '.md'];
@@ -1488,16 +1548,19 @@ function handleFileSelect(file) {
         <span style="color: var(--text-muted);">(${formatFileSize(file.size)})</span>
     `;
 }
 function formatFileSize(bytes) {
     if (bytes < 1024) return bytes + ' B';
     if (bytes < 1024 * 1024) return (bytes / 1024).toFixed(1) + ' KB';
     return (bytes / (1024 * 1024)).toFixed(1) + ' MB';
 }
 // Analyze Button
 document.getElementById('analyze-btn').addEventListener('click', async () => {
     const activeTab = document.querySelector('.input-tab.active').dataset.tab;
     const textInput = document.getElementById('text-input').value.trim();
     const fileInput = document.getElementById('file-input').files[0];
     if (activeTab === 'paste' && !textInput) {
         alert('Please paste some text to analyze (minimum 50 characters).');
         return;
@@ -1510,21 +1573,26 @@ document.getElementById('analyze-btn').addEventListener('click', async () => {
         alert('Please select a file to upload.');
         return;
     }
     await performAnalysis(activeTab, textInput, fileInput);
 });
 // Refresh Button - clears everything and shows empty state
 document.getElementById('refresh-btn').addEventListener('click', () => {
     resetAnalysisInterface();
 });
 // Try Next Button - same as refresh but keeps the interface ready
 document.getElementById('try-next-btn').addEventListener('click', () => {
     resetAnalysisInterface();
 });
 async function performAnalysis(mode, text, file) {
     const analyzeBtn = document.getElementById('analyze-btn');
     analyzeBtn.disabled = true;
     analyzeBtn.innerHTML = '⏳ Analyzing...';
     showLoading();
     try {
         let response;
         if (mode === 'paste') {
@@ -1542,12 +1610,14 @@ async function performAnalysis(mode, text, file) {
         analyzeBtn.innerHTML = '🔍 Analyze Text';
     }
 }
 async function analyzeText(text) {
     const domain = document.getElementById('domain-select').value || null;
     const enableAttribution = document.getElementById('enable-attribution').checked;
     const enableHighlighting = document.getElementById('enable-highlighting').checked;
     const useSentenceLevel = document.getElementById('use-sentence-level').checked;
     const includeMetricsSummary = document.getElementById('include-metrics-summary').checked;
     const response = await fetch(`${API_BASE}/api/analyze`, {
         method: 'POST',
         headers: { 'Content-Type': 'application/json' },
@@ -1561,17 +1631,20 @@ async function analyzeText(text) {
             skip_expensive_metrics: false
         })
     });
     if (!response.ok) {
         const error = await response.json();
         throw new Error(error.error || 'Analysis failed');
     }
     return await response.json();
 }
 async function analyzeFile(file) {
     const domain = document.getElementById('domain-select').value || null;
     const enableAttribution = document.getElementById('enable-attribution').checked;
     const useSentenceLevel = document.getElementById('use-sentence-level').checked;
     const includeMetricsSummary = document.getElementById('include-metrics-summary').checked;
     const formData = new FormData();
     formData.append('file', file);
     if (domain) formData.append('domain', domain);
@@ -1579,16 +1652,19 @@ async function analyzeFile(file) {
     formData.append('use_sentence_level', useSentenceLevel.toString());
     formData.append('include_metrics_summary', includeMetricsSummary.toString());
     formData.append('skip_expensive_metrics', 'false');
     const response = await fetch(`${API_BASE}/api/analyze/file`, {
         method: 'POST',
         body: formData
     });
     if (!response.ok) {
         const error = await response.json();
         throw new Error(error.error || 'File analysis failed');
     }
     return await response.json();
 }
 function showLoading() {
     document.getElementById('summary-report').innerHTML = `
         <div class="loading">
@@ -1600,6 +1676,7 @@ function showLoading() {
         </div>
     `;
 }
 function showError(message) {
     document.getElementById('summary-report').innerHTML = `
         <div class="empty-state">
@@ -1609,6 +1686,7 @@ function showError(message) {
         </div>
     `;
 }
 function displayResults(data) {
     console.log('Response data:', data);
     // Handle different response structures
@@ -1618,13 +1696,16 @@ function displayResults(data) {
         console.error('Full response:', data);
         return;
     }
     // Extract data based on your actual API structure
     const ensemble = detection.ensemble_result || detection.ensemble;
     const prediction = detection.prediction || {};
     const metrics = detection.metric_results || detection.metrics;
     const analysis = detection.analysis || {};
     // Display Summary with enhanced reasoning
     displaySummary(ensemble, prediction, analysis, data.attribution, data.reasoning);
     // Display Highlighted Text with enhanced features
     if (data.highlighted_html) {
         displayHighlightedText(data.highlighted_html);
@@ -1635,6 +1716,7 @@ function displayResults(data) {
             </div>
         `;
     }
     // Display Metrics with carousel
     if (metrics && Object.keys(metrics).length > 0) {
         displayMetricsCarousel(metrics, analysis, ensemble);
@@ -1646,10 +1728,48 @@ function displayResults(data) {
         `;
     }
 }
 function displaySummary(ensemble, prediction, analysis, attribution, reasoning) {
-    // Use ensemble values from your actual API response
     const aiProbability = ensemble.ai_probability !== undefined ?
         (ensemble.ai_probability * 100).toFixed(0) : '0';
     const verdict = ensemble.final_verdict || 'Unknown';
     const confidence = ensemble.overall_confidence !== undefined ?
         (ensemble.overall_confidence * 100).toFixed(1) : '0';
@@ -1657,81 +1777,289 @@ function displaySummary(ensemble, prediction, analysis, attribution, reasoning)
     const isAI = verdict.toLowerCase().includes('ai');
     const gaugeColor = isAI ? 'var(--danger)' : 'var(--success)';
     const gaugeDegree = aiProbability * 3.6;
-    const confidenceLevel = parseFloat(confidence) >= 70 ? 'HIGH' :
-                           parseFloat(confidence) >= 40 ? 'MEDIUM' : 'LOW';
-    const confidenceClass = confidenceLevel === 'HIGH' ? 'confidence-high' :
-                           confidenceLevel === 'MEDIUM' ? 'confidence-medium' : 'confidence-low';
-    let attributionHTML = '';
-    if (attribution && attribution.predicted_model) {
-        const modelName = attribution.predicted_model.replace(/_/g, ' ').replace(/-/g, ' ').toUpperCase();
-        const modelConf = attribution.confidence ?
-            (attribution.confidence * 100).toFixed(1) : 'N/A';
-        let topModels = '';
-        if (attribution.model_probabilities) {
-            const sorted = Object.entries(attribution.model_probabilities)
-                .sort((a, b) => b[1] - a[1])
-                .slice(0, 3);
-            topModels = sorted.map(([model, prob]) =>
-                `<div class="model-match" style="margin-top: 0.5rem;">
-                    <span class="model-name">${model.replace(/_/g, ' ').replace(/-/g, ' ').toUpperCase()}</span>
-                    <span class="model-confidence">${(prob * 100).toFixed(1)}%</span>
-                </div>`
-            ).join('');
-        }
-        attributionHTML = `
             <div class="attribution-section">
                 <div class="attribution-title">🤖 AI Model Attribution</div>
-                ${topModels}
-                ${attribution.reasoning && attribution.reasoning.length > 0 ?
-                    `<p style="color: var(--text-secondary); margin-top: 1rem; font-size: 0.9rem;">${attribution.reasoning[0]}</p>` : ''}
             </div>
         `;
     }
-    document.getElementById('summary-report').innerHTML = `
-        <div class="result-summary">
-            <div class="gauge-container">
-                <div class="gauge-circle" style="--gauge-color: ${gaugeColor}; --gauge-degree: ${gaugeDegree}deg;">
-                    <div class="gauge-inner">
-                        <div class="gauge-value">${aiProbability}%</div>
-                        <div class="gauge-label">AI Probability</div>
-                    </div>
-                </div>
             </div>
-            <div class="result-info-grid">
-                <div class="info-card">
-                    <div class="info-label">Verdict</div>
-                    <div class="info-value" style="font-size: 1.2rem;">${verdict}</div>
-                </div>
-                <div class="info-card">
-                    <div class="info-label">Confidence Level</div>
-                    <div class="info-value">
-                        <span class="confidence-badge ${confidenceClass}">${confidence}%</span>
-                    </div>
                 </div>
-                <div class="info-card">
-                    <div class="info-label">Content Domain</div>
-                    <div class="info-value" style="font-size: 1.1rem;">${formatDomainName(domain)}</div>
                 </div>
             </div>
-            ${createEnhancedReasoningHTML(ensemble, analysis, reasoning)}
-            ${attributionHTML}
-            <div class="download-actions">
-                <button class="download-btn" onclick="downloadReport('json')">
-                    📄 Download JSON
-                </button>
-                <button class="download-btn" onclick="downloadReport('pdf')">
-                    📑 Download PDF Report
-                </button>
             </div>
         </div>
     `;
 }
 function createEnhancedReasoningHTML(ensemble, analysis, reasoning) {
-    // Use actual reasoning data if available
     if (reasoning && reasoning.summary) {
-        // Process markdown-style *text* to <strong> tags
-        let processedSummary = reasoning.summary;
-        processedSummary = processedSummary.replace(/\*([^*]+)\*/g, '<strong>$1</strong>');
         return `
             <div class="reasoning-box enhanced">
                 <div class="reasoning-header">
@@ -1745,23 +2073,41 @@ function createEnhancedReasoningHTML(ensemble, analysis, reasoning) {
                     <div class="verdict-text">${ensemble.final_verdict}</div>
                     <div class="probability">AI Probability: <span class="probability-value">${(ensemble.ai_probability * 100).toFixed(2)}%</span></div>
                 </div>
-                <div class="reasoning-text-content">
-                    ${processedSummary}
                 </div>
-                ${reasoning.key_indicators && reasoning.key_indicators.length > 0 ? `
                     <div class="metrics-breakdown">
-                        <div class="breakdown-header">Key Indicators</div>
-                        ${reasoning.key_indicators.map(indicator => {
-                            let processedIndicator = indicator;
-                            processedIndicator = processedIndicator.replace(/\*([^*]+)\*/g, '<strong>$1</strong>');
-                            return `
-                                <div class="metric-indicator">
-                                    <div class="metric-name">${processedIndicator.split(':')[0]}</div>
-                                    <div class="metric-details">
-                                        <span class="reasoning-text-content">${processedIndicator.split(':')[1]}</span>
                                     </div>
-                                </div>
-                            `;
                         }).join('')}
                     </div>
                 ` : ''}
@@ -1778,7 +2124,7 @@ function createEnhancedReasoningHTML(ensemble, analysis, reasoning) {
     return `
         <div class="reasoning-box">
             <div class="reasoning-title">💡 Detection Reasoning</div>
-            <p class="reasoning-text">
                 Analysis based on 6-metric ensemble with domain-aware calibration.
                 The system evaluated linguistic patterns, statistical features, and semantic structures
                 to determine content authenticity with ${(ensemble.overall_confidence * 100).toFixed(1)}% confidence.
@@ -1786,6 +2132,58 @@ function createEnhancedReasoningHTML(ensemble, analysis, reasoning) {
         </div>
     `;
 }
 function displayHighlightedText(html) {
     document.getElementById('highlighted-report').innerHTML = `
         ${createDefaultLegend()}
@@ -1795,6 +2193,7 @@ function displayHighlightedText(html) {
         ${getHighlightStyles()}
     `;
 }
 function createDefaultLegend() {
     return `
         <div class="highlight-legend">
@@ -1833,6 +2232,7 @@ function createDefaultLegend() {
         </div>
     `;
 }
 function getHighlightStyles() {
     return `
         <style>
@@ -1888,10 +2288,12 @@ function getHighlightStyles() {
         </style>
     `;
 }
 function displayMetricsCarousel(metrics, analysis, ensemble) {
     const metricOrder = ['structural', 'perplexity', 'entropy', 'semantic_analysis', 'linguistic', 'multi_perturbation_stability'];
     const availableMetrics = metricOrder.filter(key => metrics[key]);
     totalMetrics = availableMetrics.length;
     if (totalMetrics === 0) {
         document.getElementById('metrics-report').innerHTML = `
             <div class="empty-state">
@@ -1900,24 +2302,39 @@ function displayMetricsCarousel(metrics, analysis, ensemble) {
         `;
         return;
     }
     let carouselHTML = `
         <div class="metrics-carousel-container">
             <div class="metrics-carousel-content">
     `;
     availableMetrics.forEach((metricKey, index) => {
         const metric = metrics[metricKey];
         if (!metric) return;
         const aiProb = (metric.ai_probability * 100).toFixed(1);
         const humanProb = (metric.human_probability * 100).toFixed(1);
         const confidence = (metric.confidence * 100).toFixed(1);
         const weight = ensemble.metric_contributions && ensemble.metric_contributions[metricKey] ?
-              (ensemble.metric_contributions[metricKey].weight * 100).toFixed(1) : '0.0';
-        const color = metric.ai_probability >= 0.6 ? 'var(--danger)' :
-                     metric.ai_probability >= 0.4 ? 'var(--warning)' : 'var(--success)';
-        const verdictText = metric.ai_probability >= 0.6 ? 'AI' :
-                           metric.ai_probability >= 0.4 ? 'UNCERTAIN' : 'HUMAN';
-        const verdictClass = verdictText === 'AI' ? 'verdict-ai' :
-                            verdictText === 'UNCERTAIN' ? 'verdict-uncertain' : 'verdict-human';
         carouselHTML += `
             <div class="metric-slide ${index === 0 ? 'active' : ''}" data-metric-index="${index}">
                 <div class="metric-result-card">
@@ -1927,22 +2344,32 @@ function displayMetricsCarousel(metrics, analysis, ensemble) {
                     <div class="metric-description">
                         ${getMetricDescription(metricKey)}
                     </div>
-                    <div style="display: flex; gap: 1rem; margin: 1rem 0;">
-                        <div style="flex: 1;">
                             <div style="font-size: 0.75rem; color: var(--text-muted); margin-bottom: 0.25rem;">AI</div>
                             <div style="background: rgba(51, 65, 85, 0.5); height: 8px; border-radius: 4px; overflow: hidden;">
                                 <div style="background: var(--danger); height: 100%; width: ${aiProb}%; transition: width 0.5s;"></div>
                             </div>
                             <div style="font-size: 0.85rem; font-weight: 600; margin-top: 0.25rem;">${aiProb}%</div>
                         </div>
-                        <div style="flex: 1;">
                             <div style="font-size: 0.75rem; color: var(--text-muted); margin-bottom: 0.25rem;">Human</div>
                             <div style="background: rgba(51, 65, 85, 0.5); height: 8px; border-radius: 4px; overflow: hidden;">
                                 <div style="background: var(--success); height: 100%; width: ${humanProb}%; transition: width 0.5s;"></div>
                             </div>
                             <div style="font-size: 0.85rem; font-weight: 600; margin-top: 0.25rem;">${humanProb}%</div>
                         </div>
                     </div>
                     <div style="display: flex; justify-content: space-between; align-items: center; margin: 0.75rem 0;">
                         <span class="metric-verdict ${verdictClass}">${verdictText}</span>
                         <span style="font-size: 0.85rem; color: var(--text-secondary);">Confidence: ${confidence}% | Weight: ${weight}%</span>
@@ -1953,6 +2380,7 @@ function displayMetricsCarousel(metrics, analysis, ensemble) {
             </div>
         `;
     });
     carouselHTML += `
             </div>
             <div class="metrics-carousel-nav">
@@ -1962,9 +2390,11 @@ function displayMetricsCarousel(metrics, analysis, ensemble) {
             </div>
         </div>
     `;
     document.getElementById('metrics-report').innerHTML = carouselHTML;
     updateCarouselButtons();
 }
 function navigateMetrics(direction) {
     const newMetricIndex = currentMetricIndex + direction;
     if (newMetricIndex >= 0 && newMetricIndex < totalMetrics) {
@@ -1972,6 +2402,7 @@ function navigateMetrics(direction) {
         updateMetricCarousel();
     }
 }
 function updateMetricCarousel() {
     const slides = document.querySelectorAll('.metric-slide');
     slides.forEach((slide, index) => {
@@ -1988,6 +2419,7 @@ function updateMetricCarousel() {
         positionElement.textContent = `${currentMetricIndex + 1} / ${totalMetrics}`;
     }
 }
 function updateCarouselButtons() {
     const prevBtn = document.querySelector('.prev-btn');
     const nextBtn = document.querySelector('.next-btn');
@@ -1998,8 +2430,10 @@ function updateCarouselButtons() {
         nextBtn.disabled = currentMetricIndex === totalMetrics - 1;
     }
 }
 function renderMetricDetails(metricName, details) {
     if (!details || Object.keys(details).length === 0) return '';
     // Key metrics to show for each type
     const importantKeys = {
         'structural': ['burstiness_score', 'length_uniformity', 'avg_sentence_length', 'std_sentence_length'],
@@ -2007,29 +2441,47 @@ function renderMetricDetails(metricName, details) {
         'entropy': ['token_diversity', 'sequence_unpredictability', 'char_entropy'],
         'semantic_analysis': ['coherence_score', 'consistency_score', 'repetition_score'],
         'linguistic': ['pos_diversity', 'syntactic_complexity', 'grammatical_consistency'],
-        'multi_perturbation_stability': ['stability_score', 'curvature_score', 'likelihood_ratio']
     };
     const keysToShow = importantKeys[metricName] || Object.keys(details).slice(0, 6);
     let detailsHTML = '<div style="margin-top: 1rem; padding-top: 1rem; border-top: 1px solid var(--border);">';
     detailsHTML += '<div style="font-size: 0.9rem; font-weight: 600; color: var(--text-secondary); margin-bottom: 0.75rem;">📈 Detailed Metrics:</div>';
     detailsHTML += '<div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(180px, 1fr)); gap: 0.75rem; font-size: 0.85rem;">';
     keysToShow.forEach(key => {
         if (details[key] !== undefined && details[key] !== null) {
-            const value = typeof details[key] === 'number' ?
-                (details[key] < 1 && details[key] > 0 ? (details[key] * 100).toFixed(2) + '%' : details[key].toFixed(2)) :
-                details[key];
             const label = key.replace(/_/g, ' ').replace(/\b\w/g, c => c.toUpperCase());
             detailsHTML += `
                 <div style="background: rgba(15, 23, 42, 0.6); padding: 0.5rem; border-radius: 6px;">
                     <div style="color: var(--text-muted); font-size: 0.75rem; margin-bottom: 0.25rem;">${label}</div>
-                    <div style="color: var(--primary); font-weight: 700;">${value}</div>
                 </div>
             `;
         }
     });
     detailsHTML += '</div></div>';
     return detailsHTML;
 }
 function getMetricDescription(metricName) {
     const descriptions = {
         structural: 'Analyzes sentence structure, length patterns, and statistical features.',
@@ -2041,6 +2493,7 @@ function getMetricDescription(metricName) {
     };
     return descriptions[metricName] || 'Metric analysis complete.';
 }
 function formatMetricName(name) {
     const names = {
         structural: 'Structural Analysis',
@@ -2052,17 +2505,17 @@ function formatMetricName(name) {
     };
     return names[name] || name.split('_').map(w => w.charAt(0).toUpperCase() + w.slice(1)).join(' ');
 }
-function formatDomainName(domain) {
-    return domain.split('_').map(w => w.charAt(0).toUpperCase() + w.slice(1)).join(' ');
-}
 async function downloadReport(format) {
     if (!currentAnalysisData) {
         alert('No analysis data available');
         return;
     }
     try {
         const analysisId = currentAnalysisData.analysis_id;
         const timestamp = new Date().toISOString().replace(/[:.]/g, '-');
         // For JSON, download directly from current data
         if (format === 'json') {
             const data = {
@@ -2077,6 +2530,7 @@ async function downloadReport(format) {
             await downloadBlob(blob, filename);
             return;
         }
         // Get the original text for report generation
         const activeTab = document.querySelector('.input-tab.active').dataset.tab;
         let textToSend = '';
@@ -2086,19 +2540,23 @@ async function downloadReport(format) {
             textToSend = currentAnalysisData.detection_result?.processed_text?.text ||
                         'Uploaded file content - see analysis for details';
         }
         // For PDF, request from server
         const formData = new FormData();
         formData.append('analysis_id', analysisId);
         formData.append('text', textToSend);
         formData.append('formats', format);
         formData.append('include_highlights', document.getElementById('enable-highlighting').checked.toString());
         const response = await fetch(`${API_BASE}/api/report/generate`, {
             method: 'POST',
             body: formData
         });
         if (!response.ok) {
             throw new Error('Report generation failed');
         }
         const result = await response.json();
         if (result.reports && result.reports[format]) {
             const filename = result.reports[format];
@@ -2117,6 +2575,7 @@ async function downloadReport(format) {
         alert('Failed to download report. Please try again.');
     }
 }
 async function downloadBlob(blob, filename) {
     try {
         const url = URL.createObjectURL(blob);
@@ -2136,6 +2595,7 @@ async function downloadBlob(blob, filename) {
         alert('Download failed. Please try again.');
     }
 }
 function showDownloadSuccess(filename) {
     const notification = document.createElement('div');
     notification.style.cssText = `
@@ -2158,6 +2618,7 @@ function showDownloadSuccess(filename) {
         </div>
     `;
     document.body.appendChild(notification);
     if (!document.querySelector('#download-animation')) {
         const style = document.createElement('style');
         style.id = 'download-animation';
@@ -2169,12 +2630,14 @@ function showDownloadSuccess(filename) {
         `;
         document.head.appendChild(style);
     }
     setTimeout(() => {
         if (notification.parentNode) {
             notification.parentNode.removeChild(notification);
         }
     }, 3000);
 }
 // Smooth scrolling for anchor links
 document.querySelectorAll('a[href^="#"]').forEach(anchor => {
     anchor.addEventListener('click', function (e) {
@@ -2188,6 +2651,7 @@ document.querySelectorAll('a[href^="#"]').forEach(anchor => {
         }
     });
 });
 // Initialize - show landing page by default
 showLanding();
 </script>

     padding: 2rem;
     border: 1px solid var(--border);
     backdrop-filter: blur(10px);
     height: 850px;
     overflow: hidden;
     display: flex;
     color: var(--text-secondary);
     line-height: 1.7;
 }
+/* Reasoning Styles */
 .reasoning-box.enhanced {
     background: linear-gradient(135deg, rgba(30, 41, 59, 0.95) 0%, rgba(15, 23, 42, 0.95) 100%);
     border: 1px solid rgba(71, 85, 105, 0.5);
 .metric-indicator {
     display: flex;
     justify-content: space-between;
+    align-items: left;
     padding: 0.75rem;
     margin-bottom: 0.5rem;
     border-radius: 8px;
     transform: translateX(4px);
 }
 .metric-name {
+    font-weight: 400;
     color: var(--text-primary);
     min-width: 140px;
 }
     font-weight: 700;
     color: var(--primary);
 }
+.attribution-confidence {
+    margin-top: 0.75rem;
+    font-size: 0.85rem;
+    color: var(--text-secondary);
+}
+.attribution-uncertain {
+    color: var(--text-muted);
+    font-style: italic;
+    margin-top: 0.5rem;
+    font-size: 0.9rem;
+}
+.attribution-reasoning {
+    color: var(--text-secondary);
+    margin-top: 1rem;
+    font-size: 0.9rem;
+    line-height: 1.4;
+}
 /* Download Actions */
 .download-actions {
     display: flex;
 }
 .metrics-carousel-content {
     flex: 1;
     padding: 0;
     display: flex;
     align-items: flex-start;
     justify-content: flex-start;
     overflow-y: auto;
     padding: 1rem;
 }
 .metric-slide {
     display: none;
     width: 100%;
     padding: 1rem;
 }
 .metric-slide.active {
     color: var(--text-secondary);
     font-weight: 600;
 }
+/* Info Card Text Styles */
+.verdict-text {
+    font-size: 1.2rem !important;
+}
+.domain-text {
+    font-size: 1.1rem !important;
+}
+.verdict-mixed {
+    background: rgba(168, 85, 247, 0.2);
+    color: #a855f7;
+    border: 1px solid rgba(168, 85, 247, 0.3);
+}
+/* Reasoning Bullet Points */
+.reasoning-bullet-points {
+    margin: 1.5rem 0;
+    line-height: 1.6;
+    text-align: left;
+}
+.bullet-point {
+    margin-bottom: 0.75rem;
+    padding-left: 0.5rem;
+    color: var(--text-secondary);
+    font-size: 0.95rem;
+    text-align: left;
+}
+.bullet-point:last-child {
+    margin-bottom: 0;
+}
+.bullet-point strong {
+    color: var(--text-primary);
+}
 /* Responsive */
 @media (max-width: 1200px) {
     .interface-grid {
                     id="text-input"
                     class="text-input"
                     placeholder="Paste your text here for analysis...
+The more text you provide (minimum 50 characters), the more accurate the detection will be."
                 ></textarea>
             </div>
             <div id="upload-tab" class="tab-content">
 let currentAnalysisData = null;
 let currentMetricIndex = 0;
 let totalMetrics = 0;
 // Navigation
 function showLanding() {
     document.getElementById('landing-page').style.display = 'block';
     document.getElementById('analysis-interface').style.display = 'none';
     window.scrollTo(0, 0);
 }
 function showAnalysis() {
     document.getElementById('landing-page').style.display = 'none';
     document.getElementById('analysis-interface').style.display = 'block';
     window.scrollTo(0, 0);
     resetAnalysisInterface();
 }
 // Reset analysis interface
 function resetAnalysisInterface() {
     // Clear text input
     currentMetricIndex = 0;
     totalMetrics = 0;
 }
 // Input Tab Switching
 document.querySelectorAll('.input-tab').forEach(tab => {
     tab.addEventListener('click', () => {
         document.getElementById(`${tabName}-tab`).classList.add('active');
     });
 });
 // Report Tab Switching
 document.querySelectorAll('.report-tab').forEach(tab => {
     tab.addEventListener('click', () => {
         document.getElementById(`${reportName}-report`).classList.add('active');
     });
 });
 // File Upload Handling
 const fileInput = document.getElementById('file-input');
 const fileUploadArea = document.getElementById('file-upload-area');
 const fileNameDisplay = document.getElementById('file-name-display');
 fileUploadArea.addEventListener('click', () => {
     fileInput.click();
 });
 fileInput.addEventListener('change', (e) => {
     handleFileSelect(e.target.files[0]);
 });
 // Drag and Drop
 fileUploadArea.addEventListener('dragover', (e) => {
     e.preventDefault();
     fileUploadArea.classList.add('drag-over');
 });
 fileUploadArea.addEventListener('dragleave', () => {
     fileUploadArea.classList.remove('drag-over');
 });
 fileUploadArea.addEventListener('drop', (e) => {
     e.preventDefault();
     fileUploadArea.classList.remove('drag-over');
         handleFileSelect(file);
     }
 });
 function handleFileSelect(file) {
     if (!file) return;
     const allowedTypes = ['.txt', '.pdf', '.docx', '.doc', '.md'];
         <span style="color: var(--text-muted);">(${formatFileSize(file.size)})</span>
     `;
 }
 function formatFileSize(bytes) {
     if (bytes < 1024) return bytes + ' B';
     if (bytes < 1024 * 1024) return (bytes / 1024).toFixed(1) + ' KB';
     return (bytes / (1024 * 1024)).toFixed(1) + ' MB';
 }
 // Analyze Button
 document.getElementById('analyze-btn').addEventListener('click', async () => {
     const activeTab = document.querySelector('.input-tab.active').dataset.tab;
     const textInput = document.getElementById('text-input').value.trim();
     const fileInput = document.getElementById('file-input').files[0];
     if (activeTab === 'paste' && !textInput) {
         alert('Please paste some text to analyze (minimum 50 characters).');
         return;
         alert('Please select a file to upload.');
         return;
     }
     await performAnalysis(activeTab, textInput, fileInput);
 });
 // Refresh Button - clears everything and shows empty state
 document.getElementById('refresh-btn').addEventListener('click', () => {
     resetAnalysisInterface();
 });
 // Try Next Button - same as refresh but keeps the interface ready
 document.getElementById('try-next-btn').addEventListener('click', () => {
     resetAnalysisInterface();
 });
 async function performAnalysis(mode, text, file) {
     const analyzeBtn = document.getElementById('analyze-btn');
     analyzeBtn.disabled = true;
     analyzeBtn.innerHTML = '⏳ Analyzing...';
     showLoading();
     try {
         let response;
         if (mode === 'paste') {
         analyzeBtn.innerHTML = '🔍 Analyze Text';
     }
 }
 async function analyzeText(text) {
     const domain = document.getElementById('domain-select').value || null;
     const enableAttribution = document.getElementById('enable-attribution').checked;
     const enableHighlighting = document.getElementById('enable-highlighting').checked;
     const useSentenceLevel = document.getElementById('use-sentence-level').checked;
     const includeMetricsSummary = document.getElementById('include-metrics-summary').checked;
     const response = await fetch(`${API_BASE}/api/analyze`, {
         method: 'POST',
         headers: { 'Content-Type': 'application/json' },
             skip_expensive_metrics: false
         })
     });
     if (!response.ok) {
         const error = await response.json();
         throw new Error(error.error || 'Analysis failed');
     }
     return await response.json();
 }
 async function analyzeFile(file) {
     const domain = document.getElementById('domain-select').value || null;
     const enableAttribution = document.getElementById('enable-attribution').checked;
     const useSentenceLevel = document.getElementById('use-sentence-level').checked;
     const includeMetricsSummary = document.getElementById('include-metrics-summary').checked;
     const formData = new FormData();
     formData.append('file', file);
     if (domain) formData.append('domain', domain);
     formData.append('use_sentence_level', useSentenceLevel.toString());
     formData.append('include_metrics_summary', includeMetricsSummary.toString());
     formData.append('skip_expensive_metrics', 'false');
     const response = await fetch(`${API_BASE}/api/analyze/file`, {
         method: 'POST',
         body: formData
     });
     if (!response.ok) {
         const error = await response.json();
         throw new Error(error.error || 'File analysis failed');
     }
     return await response.json();
 }
 function showLoading() {
     document.getElementById('summary-report').innerHTML = `
         <div class="loading">
         </div>
     `;
 }
 function showError(message) {
     document.getElementById('summary-report').innerHTML = `
         <div class="empty-state">
         </div>
     `;
 }
 function displayResults(data) {
     console.log('Response data:', data);
     // Handle different response structures
         console.error('Full response:', data);
         return;
     }
     // Extract data based on your actual API structure
     const ensemble = detection.ensemble_result || detection.ensemble;
     const prediction = detection.prediction || {};
     const metrics = detection.metric_results || detection.metrics;
     const analysis = detection.analysis || {};
     // Display Summary with enhanced reasoning
     displaySummary(ensemble, prediction, analysis, data.attribution, data.reasoning);
     // Display Highlighted Text with enhanced features
     if (data.highlighted_html) {
         displayHighlightedText(data.highlighted_html);
             </div>
         `;
     }
     // Display Metrics with carousel
     if (metrics && Object.keys(metrics).length > 0) {
         displayMetricsCarousel(metrics, analysis, ensemble);
         `;
     }
 }
 function displaySummary(ensemble, prediction, analysis, attribution, reasoning) {
+    // Extract and validate data with fallbacks
+    const {
+        aiProbability,
+        humanProbability,
+        mixedProbability,
+        verdict,
+        confidence,
+        domain,
+        isAI,
+        gaugeColor,
+        gaugeDegree,
+        confidenceLevel,
+        confidenceClass
+    } = extractSummaryData(ensemble, analysis);
+    // Generate attribution HTML with proper filtering
+    const attributionHTML = generateAttributionHTML(attribution);
+    document.getElementById('summary-report').innerHTML = `
+        <div class="result-summary">
+            ${createGaugeSection(aiProbability, humanProbability, mixedProbability, gaugeColor, gaugeDegree)}
+            ${createInfoGrid(verdict, confidence, confidenceClass, domain, mixedProbability)}
+            ${createEnhancedReasoningHTML(ensemble, analysis, reasoning)}
+            ${attributionHTML}
+            ${createDownloadActions()}
+        </div>
+    `;
+}
+// Helper function to extract and validate summary data
+function extractSummaryData(ensemble, analysis) {
     const aiProbability = ensemble.ai_probability !== undefined ?
         (ensemble.ai_probability * 100).toFixed(0) : '0';
+    const humanProbability = ensemble.human_probability !== undefined ?
+        (ensemble.human_probability * 100).toFixed(0) : '0';
+    const mixedProbability = ensemble.mixed_probability !== undefined ?
+        (ensemble.mixed_probability * 100).toFixed(0) : '0';
     const verdict = ensemble.final_verdict || 'Unknown';
     const confidence = ensemble.overall_confidence !== undefined ?
         (ensemble.overall_confidence * 100).toFixed(1) : '0';
     const isAI = verdict.toLowerCase().includes('ai');
     const gaugeColor = isAI ? 'var(--danger)' : 'var(--success)';
     const gaugeDegree = aiProbability * 3.6;
+    const confidenceLevel = getConfidenceLevel(parseFloat(confidence));
+    const confidenceClass = getConfidenceClass(confidenceLevel);
+    return {
+        aiProbability,
+        humanProbability,
+        mixedProbability,
+        verdict,
+        confidence,
+        domain,
+        isAI,
+        gaugeColor,
+        gaugeDegree,
+        confidenceLevel,
+        confidenceClass
+    };
+}
+// Helper function to determine confidence level
+function getConfidenceLevel(confidence) {
+    if (confidence >= 70) return 'HIGH';
+    if (confidence >= 40) return 'MEDIUM';
+    return 'LOW';
+}
+// Helper function to get confidence CSS class
+function getConfidenceClass(confidenceLevel) {
+    const classMap = {
+        'HIGH': 'confidence-high',
+        'MEDIUM': 'confidence-medium',
+        'LOW': 'confidence-low'
+    };
+    return classMap[confidenceLevel] || 'confidence-low';
+}
+// Helper function to generate attribution HTML with filtering
+function generateAttributionHTML(attribution) {
+    if (!attribution || !attribution.predicted_model) {
+        return '';
+    }
+    const modelName = formatModelName(attribution.predicted_model);
+    const modelConf = attribution.confidence ?
+        (attribution.confidence * 100).toFixed(1) : 'N/A';
+    const topModelsHTML = generateTopModelsHTML(attribution.model_probabilities);
+    const reasoningHTML = generateAttributionReasoningHTML(attribution.reasoning);
+    // Only show attribution if confidence is meaningful (>30%)
+    if (attribution.confidence > 0.3) {
+        return `
             <div class="attribution-section">
                 <div class="attribution-title">🤖 AI Model Attribution</div>
+                ${topModelsHTML}
+                <div class="attribution-confidence">
+                    Attribution Confidence: <strong>${modelConf}%</strong>
+                </div>
+                ${reasoningHTML}
             </div>
         `;
     }
+    return '';
+}
+// Helper function to generate top models HTML with filtering
+function generateTopModelsHTML(modelProbabilities) {
+    if (!modelProbabilities) {
+        return '<div class="attribution-uncertain">Model probabilities not available</div>';
+    }
+    // Filter and sort models
+    const meaningfulModels = Object.entries(modelProbabilities)
+        .sort((a, b) => b[1] - a[1])
+        .filter(([model, prob]) => prob > 0.15) // Only show models with >15% probability
+        .slice(0, 3); // Show top 3
+    if (meaningfulModels.length === 0) {
+        return `
+            <div class="attribution-uncertain">
+                Model attribution uncertain - text patterns don't strongly match any specific AI model
             </div>
+        `;
+    }
+    return meaningfulModels.map(([model, prob]) =>
+        `<div class="model-match">
+            <span class="model-name">${formatModelName(model)}</span>
+            <span class="model-confidence">${(prob * 100).toFixed(1)}%</span>
+        </div>`
+    ).join('');
+}
+// Helper function to format model names
+function formatModelName(modelName) {
+    return modelName.replace(/_/g, ' ').replace(/-/g, ' ').toUpperCase();
+}
+// Helper function to generate attribution reasoning HTML
+function generateAttributionReasoningHTML(reasoning) {
+    if (!reasoning || !Array.isArray(reasoning) || reasoning.length === 0) {
+        return '';
+    }
+    return `
+        <div class="attribution-reasoning">
+            ${reasoning[0]}
+        </div>
+    `;
+}
+// Helper function to create single-progress gauge section
+function createGaugeSection(aiProbability, humanProbability, mixedProbability, gaugeColor, gaugeDegree) {
+    // Determine which probability is highest
+    let maxValue, maxColor, maxLabel;
+    if (aiProbability >= humanProbability && aiProbability >= mixedProbability) {
+        maxValue = aiProbability;
+        maxColor = 'var(--danger)';
+        maxLabel = 'AI Probability';
+    } else if (humanProbability >= aiProbability && humanProbability >= mixedProbability) {
+        maxValue = humanProbability;
+        maxColor = 'var(--success)';
+        maxLabel = 'Human Probability';
+    } else {
+        maxValue = mixedProbability;
+        maxColor = 'var(--primary)';
+        maxLabel = 'Mixed Probability';
+    }
+    // Calculate the degree for the progress (maxValue% of 360 degrees)
+    const progressDegree = (maxValue / 100) * 360;
+    return `
+        <div class="gauge-container">
+            <div class="single-progress-gauge" style="
+                background: conic-gradient(
+                    ${maxColor} 0deg,
+                    ${maxColor} ${progressDegree}deg,
+                    rgba(51, 65, 85, 0.3) ${progressDegree}deg,
+                    rgba(51, 65, 85, 0.3) 360deg
+                );
+            ">
+                <div class="gauge-inner">
+                    <div class="gauge-value" style="color: ${maxColor}">${maxValue}%</div>
+                    <div class="gauge-label">${maxLabel}</div>
                 </div>
+            </div>
+        </div>
+        <div style="display: grid; grid-template-columns: 1fr 1fr 1fr; gap: 1rem; margin: 1.5rem 0;">
+            <div style="text-align: center; padding: 1rem; background: rgba(239, 68, 68, 0.1); border-radius: 8px; border: 1px solid rgba(239, 68, 68, 0.3);">
+                <div style="font-size: 0.85rem; color: var(--danger); margin-bottom: 0.25rem; font-weight: 600;">AI</div>
+                <div style="font-size: 1.4rem; font-weight: 700; color: var(--danger);">${aiProbability}%</div>
+            </div>
+            <div style="text-align: center; padding: 1rem; background: rgba(16, 185, 129, 0.1); border-radius: 8px; border: 1px solid rgba(16, 185, 129, 0.3);">
+                <div style="font-size: 0.85rem; color: var(--success); margin-bottom: 0.25rem; font-weight: 600;">Human</div>
+                <div style="font-size: 1.4rem; font-weight: 700; color: var(--success);">${humanProbability}%</div>
+            </div>
+            <div style="text-align: center; padding: 1rem; background: rgba(6, 182, 212, 0.1); border-radius: 8px; border: 1px solid rgba(6, 182, 212, 0.3);">
+                <div style="font-size: 0.85rem; color: var(--primary); margin-bottom: 0.25rem; font-weight: 600;">Mixed</div>
+                <div style="font-size: 1.4rem; font-weight: 700; color: var(--primary);">${mixedProbability}%</div>
+            </div>
+        </div>
+        <style>
+            .single-progress-gauge {
+                width: 220px;
+                height: 220px;
+                margin: 0 auto 2rem;
+                position: relative;
+                border-radius: 50%;
+                box-shadow: 0 4px 20px rgba(0, 0, 0, 0.3);
+            }
+            .gauge-inner {
+                position: absolute;
+                width: 170px;
+                height: 170px;
+                background: var(--bg-panel);
+                border-radius: 50%;
+                top: 50%;
+                left: 50%;
+                transform: translate(-50%, -50%);
+                display: flex;
+                flex-direction: column;
+                align-items: center;
+                justify-content: center;
+            }
+            .gauge-value {
+                font-size: 3rem;
+                font-weight: 800;
+            }
+            .gauge-label {
+                font-size: 0.9rem;
+                color: var(--text-secondary);
+                margin-top: 0.25rem;
+            }
+        </style>
+    `;
+}
+// Helper function to create info grid
+function createInfoGrid(verdict, confidence, confidenceClass, domain, mixedProbability) {
+    const mixedContentInfo = mixedProbability > 10 ?
+        `<div style="margin-top: 0.5rem; font-size: 0.85rem; color: var(--primary);">
+            🔀 ${mixedProbability}% Mixed Content Detected
+        </div>` : '';
+    return `
+        <div class="result-info-grid">
+            <div class="info-card">
+                <div class="info-label">Verdict</div>
+                <div class="info-value verdict-text">${verdict}</div>
+                ${mixedContentInfo}
+            </div>
+            <div class="info-card">
+                <div class="info-label">Confidence Level</div>
+                <div class="info-value">
+                    <span class="confidence-badge ${confidenceClass}">${confidence}%</span>
                 </div>
             </div>
+            <div class="info-card">
+                <div class="info-label">Content Domain</div>
+                <div class="info-value domain-text">${formatDomainName(domain)}</div>
             </div>
         </div>
     `;
 }
+// Helper function to create download actions
+function createDownloadActions() {
+    return `
+        <div class="download-actions">
+            <button class="download-btn" onclick="downloadReport('json')">
+                📄 Download JSON
+            </button>
+            <button class="download-btn" onclick="downloadReport('pdf')">
+                📑 Download PDF Report
+            </button>
+        </div>
+    `;
+}
+// Helper function to format domain names
+function formatDomainName(domain) {
+    return domain.split('_').map(w => w.charAt(0).toUpperCase() + w.slice(1)).join(' ');
+}
 function createEnhancedReasoningHTML(ensemble, analysis, reasoning) {
+    // Use reasoning data if available
     if (reasoning && reasoning.summary) {
+        // Process the summary into bullet points
+        const bulletPoints = formatSummaryAsBulletPoints(reasoning.summary, ensemble, analysis);
+        // Process key indicators with markdown formatting
+        let processedIndicators = [];
+        if (reasoning.key_indicators && reasoning.key_indicators.length > 0) {
+            processedIndicators = reasoning.key_indicators.map(indicator => {
+                let processedIndicator = indicator;
+                // Remove HTML entities
+                processedIndicator = processedIndicator.replace(/&ast;/g, '*')
+                                                      .replace(/&#42;/g, '*');
+                // Process bold formatting
+                processedIndicator = processedIndicator.replace(/\*\*([^*]+)\*\*/g, '<strong>$1</strong>')
+                                                      .replace(/\*([^*]+)\*/g, '<strong>$1</strong>');
+                // Clean up remaining asterisks
+                processedIndicator = processedIndicator.replace(/\*\*/g, '')
+                                                      .replace(/\*(?![^<]*>)/g, '');
+                // Replace underscores with spaces
+                processedIndicator = processedIndicator.replace(/_/g, ' ');
+                return processedIndicator;
+            });
+        }
         return `
             <div class="reasoning-box enhanced">
                 <div class="reasoning-header">
                     <div class="verdict-text">${ensemble.final_verdict}</div>
                     <div class="probability">AI Probability: <span class="probability-value">${(ensemble.ai_probability * 100).toFixed(2)}%</span></div>
                 </div>
+                <div class="reasoning-bullet-points">
+                    ${bulletPoints}
                 </div>
+                ${processedIndicators.length > 0 ? `
                     <div class="metrics-breakdown">
+                        <div class="breakdown-header" style="text-align: center; font-weight: 700; color: var(--text-secondary); margin-bottom: 1rem;">
+                            KEY INDICATORS
+                        </div>
+                        ${processedIndicators.map(indicator => {
+                            // Split indicator into metric name and sub-metric details
+                            const colonIndex = indicator.indexOf(':');
+                            if (colonIndex !== -1) {
+                                const metricName = indicator.substring(0, colonIndex).trim();
+                                const metricDetails = indicator.substring(colonIndex + 1).trim();
+                                return `
+                                    <div style="margin-bottom: 1rem; text-align: left;">
+                                        <div style="font-weight: 700; color: #fff; text-align: center; margin-bottom: 0.5rem; font-size: 1rem;">
+                                            ${metricName}
+                                        </div>
+                                        <div style="color: var(--text-secondary); font-size: 0.9rem; line-height: 1.4; text-align: left;">
+                                            ${metricDetails}
+                                        </div>
                                     </div>
+                                `;
+                            } else {
+                                // If no colon, treat as general indicator
+                                return `
+                                    <div style="margin-bottom: 1rem; text-align: left;">
+                                        <div style="color: var(--text-secondary); font-size: 0.9rem; line-height: 1.4;">
+                                            ${indicator}
+                                        </div>
+                                    </div>
+                                `;
+                            }
                         }).join('')}
                     </div>
                 ` : ''}
     return `
         <div class="reasoning-box">
             <div class="reasoning-title">💡 Detection Reasoning</div>
+            <p class="reasoning-text" style="text-align: left;">
                 Analysis based on 6-metric ensemble with domain-aware calibration.
                 The system evaluated linguistic patterns, statistical features, and semantic structures
                 to determine content authenticity with ${(ensemble.overall_confidence * 100).toFixed(1)}% confidence.
         </div>
     `;
 }
+// Helper function to format summary as bullet points
+function formatSummaryAsBulletPoints(summary, ensemble, analysis) {
+    let processedSummary = summary;
+    // Remove any existing HTML entities for asterisks first
+    processedSummary = processedSummary.replace(/&ast;/g, '*')
+                                      .replace(/&#42;/g, '*');
+    // Process markdown bold formatting
+    processedSummary = processedSummary.replace(/\*\*([^*]+)\*\*/g, '<strong>$1</strong>')
+                                      .replace(/\*([^*]+)\*/g, '<strong>$1</strong>');
+    // Final cleanup: remove any remaining standalone asterisks that weren't processed
+    processedSummary = processedSummary.replace(/\*\*/g, '')
+                                      .replace(/\*(?![^<]*>)/g, '');
+    // Split the summary into sentences/phrases for bullet points
+    const sentences = processedSummary.split(/\.\s+/);
+    // Create bullet points from key information
+    const bulletPoints = [];
+    // Add confidence level as first bullet
+    const confidenceLevel = ensemble.overall_confidence >= 0.7 ? 'High Confidence' :
+                           ensemble.overall_confidence >= 0.4 ? 'Medium Confidence' : 'Low Confidence';
+    bulletPoints.push(`<div class="bullet-point">• ${confidenceLevel}</div>`);
+    // Add verdict as second bullet
+    bulletPoints.push(`<div class="bullet-point">• ${ensemble.final_verdict}</div>`);
+    // Add AI probability as third bullet
+    bulletPoints.push(`<div class="bullet-point">• AI Probability: ${(ensemble.ai_probability * 100).toFixed(2)}%</div>`);
+    // Add the main analysis sentences as individual bullets
+    sentences.forEach(sentence => {
+        if (sentence.trim() &&
+            !sentence.includes('confidence') &&
+            !sentence.includes(ensemble.final_verdict) &&
+            !sentence.includes('AI probability')) {
+            // Clean up the sentence and add as bullet
+            let cleanSentence = sentence.trim();
+            if (!cleanSentence.endsWith('.')) {
+                cleanSentence += '.';
+            }
+            bulletPoints.push(`<div class="bullet-point">• ${cleanSentence}</div>`);
+        }
+    });
+    return bulletPoints.join('');
+}
 function displayHighlightedText(html) {
     document.getElementById('highlighted-report').innerHTML = `
         ${createDefaultLegend()}
         ${getHighlightStyles()}
     `;
 }
 function createDefaultLegend() {
     return `
         <div class="highlight-legend">
         </div>
     `;
 }
 function getHighlightStyles() {
     return `
         <style>
         </style>
     `;
 }
 function displayMetricsCarousel(metrics, analysis, ensemble) {
     const metricOrder = ['structural', 'perplexity', 'entropy', 'semantic_analysis', 'linguistic', 'multi_perturbation_stability'];
     const availableMetrics = metricOrder.filter(key => metrics[key]);
     totalMetrics = availableMetrics.length;
     if (totalMetrics === 0) {
         document.getElementById('metrics-report').innerHTML = `
             <div class="empty-state">
         `;
         return;
     }
     let carouselHTML = `
         <div class="metrics-carousel-container">
             <div class="metrics-carousel-content">
     `;
     availableMetrics.forEach((metricKey, index) => {
         const metric = metrics[metricKey];
         if (!metric) return;
         const aiProb = (metric.ai_probability * 100).toFixed(1);
         const humanProb = (metric.human_probability * 100).toFixed(1);
+        const mixedProb = (metric.mixed_probability * 100).toFixed(1);
         const confidence = (metric.confidence * 100).toFixed(1);
         const weight = ensemble.metric_contributions && ensemble.metric_contributions[metricKey] ?
+            (ensemble.metric_contributions[metricKey].weight * 100).toFixed(1) : '0.0';
+        // Determine verdict based on probabilities
+        let verdictText, verdictClass;
+        if (metric.mixed_probability > 0.3) {
+            verdictText = 'MIXED';
+            verdictClass = 'verdict-mixed';
+        } else if (metric.ai_probability >= 0.6) {
+            verdictText = 'AI';
+            verdictClass = 'verdict-ai';
+        } else if (metric.ai_probability >= 0.4) {
+            verdictText = 'UNCERTAIN';
+            verdictClass = 'verdict-uncertain';
+        } else {
+            verdictText = 'HUMAN';
+            verdictClass = 'verdict-human';
+        }
         carouselHTML += `
             <div class="metric-slide ${index === 0 ? 'active' : ''}" data-metric-index="${index}">
                 <div class="metric-result-card">
                     <div class="metric-description">
                         ${getMetricDescription(metricKey)}
                     </div>
+                    <!-- Enhanced Probability Display with Mixed -->
+                    <div style="display: grid; grid-template-columns: 1fr 1fr 1fr; gap: 1rem; margin: 1rem 0;">
+                        <div style="text-align: center;">
                             <div style="font-size: 0.75rem; color: var(--text-muted); margin-bottom: 0.25rem;">AI</div>
                             <div style="background: rgba(51, 65, 85, 0.5); height: 8px; border-radius: 4px; overflow: hidden;">
                                 <div style="background: var(--danger); height: 100%; width: ${aiProb}%; transition: width 0.5s;"></div>
                             </div>
                             <div style="font-size: 0.85rem; font-weight: 600; margin-top: 0.25rem;">${aiProb}%</div>
                         </div>
+                        <div style="text-align: center;">
                             <div style="font-size: 0.75rem; color: var(--text-muted); margin-bottom: 0.25rem;">Human</div>
                             <div style="background: rgba(51, 65, 85, 0.5); height: 8px; border-radius: 4px; overflow: hidden;">
                                 <div style="background: var(--success); height: 100%; width: ${humanProb}%; transition: width 0.5s;"></div>
                             </div>
                             <div style="font-size: 0.85rem; font-weight: 600; margin-top: 0.25rem;">${humanProb}%</div>
                         </div>
+                        <div style="text-align: center;">
+                            <div style="font-size: 0.75rem; color: var(--text-muted); margin-bottom: 0.25rem;">Mixed</div>
+                            <div style="background: rgba(51, 65, 85, 0.5); height: 8px; border-radius: 4px; overflow: hidden;">
+                                <div style="background: var(--primary); height: 100%; width: ${mixedProb}%; transition: width 0.5s;"></div>
+                            </div>
+                            <div style="font-size: 0.85rem; font-weight: 600; margin-top: 0.25rem;">${mixedProb}%</div>
+                        </div>
                     </div>
                     <div style="display: flex; justify-content: space-between; align-items: center; margin: 0.75rem 0;">
                         <span class="metric-verdict ${verdictClass}">${verdictText}</span>
                         <span style="font-size: 0.85rem; color: var(--text-secondary);">Confidence: ${confidence}% | Weight: ${weight}%</span>
             </div>
         `;
     });
     carouselHTML += `
             </div>
             <div class="metrics-carousel-nav">
             </div>
         </div>
     `;
     document.getElementById('metrics-report').innerHTML = carouselHTML;
     updateCarouselButtons();
 }
 function navigateMetrics(direction) {
     const newMetricIndex = currentMetricIndex + direction;
     if (newMetricIndex >= 0 && newMetricIndex < totalMetrics) {
         updateMetricCarousel();
     }
 }
 function updateMetricCarousel() {
     const slides = document.querySelectorAll('.metric-slide');
     slides.forEach((slide, index) => {
         positionElement.textContent = `${currentMetricIndex + 1} / ${totalMetrics}`;
     }
 }
 function updateCarouselButtons() {
     const prevBtn = document.querySelector('.prev-btn');
     const nextBtn = document.querySelector('.next-btn');
         nextBtn.disabled = currentMetricIndex === totalMetrics - 1;
     }
 }
 function renderMetricDetails(metricName, details) {
     if (!details || Object.keys(details).length === 0) return '';
     // Key metrics to show for each type
     const importantKeys = {
         'structural': ['burstiness_score', 'length_uniformity', 'avg_sentence_length', 'std_sentence_length'],
         'entropy': ['token_diversity', 'sequence_unpredictability', 'char_entropy'],
         'semantic_analysis': ['coherence_score', 'consistency_score', 'repetition_score'],
         'linguistic': ['pos_diversity', 'syntactic_complexity', 'grammatical_consistency'],
+        'multi_perturbation_stability': ['stability_score', 'curvature_score', 'likelihood_ratio', 'perturbation_variance', 'mixed_probability']
     };
     const keysToShow = importantKeys[metricName] || Object.keys(details).slice(0, 6);
     let detailsHTML = '<div style="margin-top: 1rem; padding-top: 1rem; border-top: 1px solid var(--border);">';
     detailsHTML += '<div style="font-size: 0.9rem; font-weight: 600; color: var(--text-secondary); margin-bottom: 0.75rem;">📈 Detailed Metrics:</div>';
     detailsHTML += '<div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(180px, 1fr)); gap: 0.75rem; font-size: 0.85rem;">';
     keysToShow.forEach(key => {
         if (details[key] !== undefined && details[key] !== null) {
+            let value = details[key];
+            let displayValue;
+            // Format values appropriately
+            if (typeof value === 'number') {
+                if (key.includes('score') || key.includes('ratio') || key.includes('probability')) {
+                    displayValue = (value * 100).toFixed(2) + '%';
+                } else if (value < 1 && value > 0) {
+                    displayValue = value.toFixed(4);
+                } else {
+                    displayValue = value.toFixed(2);
+                }
+            } else {
+                displayValue = value;
+            }
             const label = key.replace(/_/g, ' ').replace(/\b\w/g, c => c.toUpperCase());
             detailsHTML += `
                 <div style="background: rgba(15, 23, 42, 0.6); padding: 0.5rem; border-radius: 6px;">
                     <div style="color: var(--text-muted); font-size: 0.75rem; margin-bottom: 0.25rem;">${label}</div>
+                    <div style="color: var(--primary); font-weight: 700;">${displayValue}</div>
                 </div>
             `;
         }
     });
     detailsHTML += '</div></div>';
     return detailsHTML;
 }
 function getMetricDescription(metricName) {
     const descriptions = {
         structural: 'Analyzes sentence structure, length patterns, and statistical features.',
     };
     return descriptions[metricName] || 'Metric analysis complete.';
 }
 function formatMetricName(name) {
     const names = {
         structural: 'Structural Analysis',
     };
     return names[name] || name.split('_').map(w => w.charAt(0).toUpperCase() + w.slice(1)).join(' ');
 }
 async function downloadReport(format) {
     if (!currentAnalysisData) {
         alert('No analysis data available');
         return;
     }
     try {
         const analysisId = currentAnalysisData.analysis_id;
         const timestamp = new Date().toISOString().replace(/[:.]/g, '-');
         // For JSON, download directly from current data
         if (format === 'json') {
             const data = {
             await downloadBlob(blob, filename);
             return;
         }
         // Get the original text for report generation
         const activeTab = document.querySelector('.input-tab.active').dataset.tab;
         let textToSend = '';
             textToSend = currentAnalysisData.detection_result?.processed_text?.text ||
                         'Uploaded file content - see analysis for details';
         }
         // For PDF, request from server
         const formData = new FormData();
         formData.append('analysis_id', analysisId);
         formData.append('text', textToSend);
         formData.append('formats', format);
         formData.append('include_highlights', document.getElementById('enable-highlighting').checked.toString());
         const response = await fetch(`${API_BASE}/api/report/generate`, {
             method: 'POST',
             body: formData
         });
         if (!response.ok) {
             throw new Error('Report generation failed');
         }
         const result = await response.json();
         if (result.reports && result.reports[format]) {
             const filename = result.reports[format];
         alert('Failed to download report. Please try again.');
     }
 }
 async function downloadBlob(blob, filename) {
     try {
         const url = URL.createObjectURL(blob);
         alert('Download failed. Please try again.');
     }
 }
 function showDownloadSuccess(filename) {
     const notification = document.createElement('div');
     notification.style.cssText = `
         </div>
     `;
     document.body.appendChild(notification);
     if (!document.querySelector('#download-animation')) {
         const style = document.createElement('style');
         style.id = 'download-animation';
         `;
         document.head.appendChild(style);
     }
     setTimeout(() => {
         if (notification.parentNode) {
             notification.parentNode.removeChild(notification);
         }
     }, 3000);
 }
 // Smooth scrolling for anchor links
 document.querySelectorAll('a[href^="#"]').forEach(anchor => {
     anchor.addEventListener('click', function (e) {
         }
     });
 });
 // Initialize - show landing page by default
 showLanding();
 </script>