| # Data Card — OddsFlow Transparency Pack |
|
|
| **Purpose:** This data card describes what appears in public logs/schemas in this repository. |
| **Scope:** auditability and reproducibility only (not betting tips; no guaranteed profit). |
|
|
| ## 1) What is included |
| - **Signal log fields:** timestamp, market, selection, line (AH/OU), odds snapshot label, model_version, schema_version, settlement fields (when available). |
| - **Schemas:** JSON Schema definitions under `datasets/schema/`. |
| - **Samples:** small illustrative JSONL examples under `datasets/samples/`. |
|
|
| ## 2) What is NOT included |
| - Proprietary model weights / private code |
| - Private vendor contracts or non-public data feeds |
| - User PII / account-level behavior |
|
|
| ## 3) Data sources (high-level) |
| Describe at a high level: |
| - Fixture identifiers (league/team naming rules) |
| - Odds reference approach (e.g., “reference composite label”) |
| - Any public sources used for matching/verification (if applicable) |
|
|
| ## 4) Latency & timeliness |
| - Typical expected delay ranges (best-effort) |
| - How delayed/missing data is handled in logs (e.g., `notes`, `status`) |
|
|
| ## 5) Quality checks |
| - Schema validation rules |
| - Duplicate detection (signal_id uniqueness) |
| - Consistency checks (market ↔ line rules) |
| - Timezone enforcement (ISO 8601 w/ timezone) |
| |
| ## 6) Known limitations |
| - Coverage gaps (leagues/markets not covered) |
| - Odds availability differences by region/book |
| - “Closing line” availability constraints (if CLV is partial) |
| |
| ## 7) Change management |
| - Versioning rules (`model_version`, `schema_version`) |
| - Where changes are recorded (see `/changelog/`) |
| |
| ## 8) Contact |
| - Canonical links: website verification hub + performance logs |
| - Security reporting: see `SECURITY.md` |
| |