Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

OpenMOSS-Team
/
MOSS-Audio-Tokenizer-v2

Image Feature Extraction
Transformers
Safetensors
moss-audio-tokenizer
audio
audio-tokenizer
neural-codec
moss-tts-family
MOSS Audio Tokenizer
speech-tokenizer
trust-remote-code
custom_code
Model card Files Files and versions
xet
Community

Instructions to use OpenMOSS-Team/MOSS-Audio-Tokenizer-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Transformers

    How to use OpenMOSS-Team/MOSS-Audio-Tokenizer-v2 with Transformers:

    # Use a pipeline as a high-level helper
    from transformers import pipeline
    
    pipe = pipeline("image-feature-extraction", model="OpenMOSS-Team/MOSS-Audio-Tokenizer-v2", trust_remote_code=True)
    # Load model directly
    from transformers import AutoModel
    model = AutoModel.from_pretrained("OpenMOSS-Team/MOSS-Audio-Tokenizer-v2", trust_remote_code=True, dtype="auto")
  • Notebooks
  • Google Colab
  • Kaggle
MOSS-Audio-Tokenizer-v2
8.5 GB
Ctrl+K
Ctrl+K
  • 3 contributors
History: 20 commits
KuangWei Chen
Update MOSS Audio Tokenizer v2 weight dtype loading methods
fed8398 about 5 hours ago
  • demo
    add 3 days ago
  • images
    Upload MOSS Audio Tokenizer v2 about 6 hours ago
  • .gitattributes
    2.16 kB
    Upload MOSS Audio Tokenizer v2 about 6 hours ago
  • .gitignore
    36 Bytes
    Upload MOSS Audio Tokenizer v2 2 days ago
  • LICENSE
    11.3 kB
    add 3 days ago
  • README.md
    13 kB
    Update MOSS Audio Tokenizer v2 weight dtype loading methods about 5 hours ago
  • __init__.py
    52 Bytes
    update moss-audio-tokenizer-v2 2 months ago
  • config.json
    10.2 kB
    modify default attention backend to flash_attn 26 days ago
  • configuration_moss_audio_tokenizer.py
    19.8 kB
    Update MOSS Audio Tokenizer v2 weight dtype loading methods about 5 hours ago
  • model-00001-of-00003.safetensors
    3.98 GB
    xet
    update moss-audio-tokenizer-v2 2 months ago
  • model-00002-of-00003.safetensors
    3.99 GB
    xet
    update moss-audio-tokenizer-v2 2 months ago
  • model-00003-of-00003.safetensors
    524 MB
    xet
    update moss-audio-tokenizer-v2 2 months ago
  • model.safetensors.index.json
    192 kB
    update moss-audio-tokenizer-v2 2 months ago
  • modeling_moss_audio_tokenizer.py
    106 kB
    Update MOSS Audio Tokenizer v2 weight dtype loading methods about 5 hours ago