Question 1

How do I fine-tune BiT on my own image dataset?

Accepted Answer

Download a pre-trained model, set up the data pipeline using TensorFlow Datasets or Torchvision as shown in the README, and run the training script with adjusted hyper-parameters. Examples for CIFAR datasets are provided to adapt.

Question 2

BiT vs ResNet pre-trained on ImageNet: which is better for transfer learning?

Accepted Answer

BiT models, especially BiT-M pre-trained on ImageNet-21k, generally outperform standard ResNet models in transfer learning benchmarks like VTAB-1k due to larger-scale training. However, ResNet might be simpler for basic tasks.

Question 3

How can I reduce memory usage when training BiT on GPUs?

Accepted Answer

Lower the batch size, reduce image resolution in hyper-parameters, or use micro-batching in PyTorch with the --batch_split option, as detailed in the README's optimization tips for memory constraints.

Question 4

Can BiT be used for object detection?

Accepted Answer

Not directly; BiT is designed for classification, but you can use its pre-trained features as a backbone for detection models. This requires additional implementation outside the provided code.

Question 5

What's the difference between BiT-M and BiT-S models?

Accepted Answer

BiT-M models are pre-trained on the larger ImageNet-21k dataset, while BiT-S uses ILSVRC-2012. BiT-M offers better generalization for transfer learning, as highlighted in the paper results.

Question 6

How to load a BiT model in PyTorch for inference?

Accepted Answer

Download the .npz file for PyTorch, use the provided model initialization code from the repository, and ensure the architecture matches. The training scripts show the basic loading process.

Big Transfer (BiT)

What is Big Transfer (BiT)?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions