dcata004 commited on
Commit
e007722
·
verified ·
1 Parent(s): a5550d5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -3
README.md CHANGED
@@ -1,3 +1,27 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - text-classification
4
+ - recruitment
5
+ - forensics
6
+ - security
7
+ license: mit
8
+ datasets:
9
+ - dcata004/recruiter-harvesting-dataset-v1
10
+ pipeline_tag: text-classification
11
+ ---
12
+
13
+ # 🐍 V.I.P.E.R. Classification Engine (v1.0)
14
+ **Maintainer:** [Cata Risk Lab](https://huggingface.co/Cata-Risk-Lab)
15
+
16
+ ## 🧠 Model Overview
17
+ This repository contains the configuration and architecture definitions for the **V.I.P.E.R.** recruitment auditing system. It defines the risk thresholds and vectorization parameters used to detect "Resume Harvesting" attacks.
18
+
19
+ ## 🛠️ Configuration
20
+ The model operates on a `TfidfVectorizer` pipeline optimized for short-text classification of email subjects and bodies.
21
+
22
+ - **Risk Threshold:** 0.75 (Confidence score required to flag as SPAM)
23
+ - **Labels:** `['harvesting', 'legitimate']`
24
+ - **Dataset:** Trained on forensic recruitment data (Swiss/US/UK).
25
+
26
+ ## ⚖️ Sovereign AI
27
+ Designed for local inference to protect user data privacy.