Question 1

How to train a metric learning model with OML on my own dataset?

Accepted Answer

Prepare your data in the required .csv format with columns for paths, labels, and categories. Then, choose a pretrained model from the zoo or define your own, modify the configuration YAML file, and run the pipeline which handles training, validation, and post-processing automatically.

Question 2

What's the difference between OML and PyTorch Metric Learning?

Accepted Answer

OML is more pipeline and recipe-oriented, providing config-based training and pretrained models for practical use cases like product retrieval, while PML is a toolkit of losses and miners without built-in pipelines. OML focuses on end-to-end solutions, whereas PML offers more flexibility for custom implementations.

Question 3

Can OML be used for text similarity tasks?

Accepted Answer

Yes, OML supports text data through integration with HuggingFace Transformers. You can use models like BERT as extractors, apply metric learning losses, and train embeddings for text retrieval, as shown in the example with mock text datasets.

Question 4

How do I deploy OML models in production?

Accepted Answer

Since OML lacks direct ONNX export, deployment typically involves using PyTorch for inference or manually converting models to ONNX. For retrieval systems, compute embeddings with trained extractors and integrate with vector databases like Qdrant or Faiss for efficient searching.

Question 5

What are the best pretrained models for image retrieval in OML?

Accepted Answer

OML's zoo includes models like ViTExtractor pretrained on benchmarks such as DeepFashion Inshop (cmc1 0.921) and Stanford Online Products (cmc1 0.866). Choose based on your domain; for general use, models like vits16_dino or unicom variants offer good starting points.

Question 6

Is OML good for audio speaker recognition?

Accepted Answer

Yes, OML includes audio support with models like ECAPATDNNExtractor, which achieves low EER on VoxCeleb benchmarks. You can train embeddings for speaker verification using the provided pipelines and pretrained checkpoints, though it requires installing the audio optional dependency.

OpenMetricLearning

What is OpenMetricLearning?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions