Канал: Efficient NLP

AI-generated text: Detection methods and countermeasures

AI-generated text: Detection methods and countermeasures

Residual Vector Quantization for Audio and Speech Embeddings

Residual Vector Quantization for Audio and Speech Embeddings

Introducing Voice Writer

Introducing Voice Writer

Can Whisper be used for real-time streaming ASR?

Can Whisper be used for real-time streaming ASR?

Top 10 most cited and influential papers in the history of NLP

Top 10 most cited and influential papers in the history of NLP

Basic facts about the Teochew language

Basic facts about the Teochew language

Fine-tuning Whisper to learn my Chinese dialect (Teochew)

Fine-tuning Whisper to learn my Chinese dialect (Teochew)

A better Hugging Face model search with OpenAI, RAG, pgvector

A better Hugging Face model search with OpenAI, RAG, pgvector

Speculative Decoding: When Two LLMs are Faster than One

Speculative Decoding: When Two LLMs are Faster than One

NLP Model Finder Tool

NLP Model Finder Tool

Fun Fact about Machine Translation

Fun Fact about Machine Translation

Exploring the 24 Areas of Natural Language Processing Research

Exploring the 24 Areas of Natural Language Processing Research

Rotary Positional Embeddings: Combining Absolute and Relative

Rotary Positional Embeddings: Combining Absolute and Relative

The KV Cache: Memory Usage in Transformers

The KV Cache: Memory Usage in Transformers

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

How is Beam Search Really Implemented?

How is Beam Search Really Implemented?

Non-Autoregressive and Shallow Decoding: Speeding up Translation

Non-Autoregressive and Shallow Decoding: Speeding up Translation

Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models

Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models