Question 1

How do I use UCE to embed my single-cell RNA-seq data?

Accepted Answer

Run the eval_single_anndata.py script with parameters like adata_path, species, and model_loc, as described in the README. It processes AnnData files and adds embeddings to .obsm["X_uce"], making it easy to integrate into downstream analyses.

Question 2

UCE vs scVI: which is better for zero-shot cell embedding?

Accepted Answer

UCE is designed specifically for zero-shot embeddings across species without training, while scVI often requires dataset-specific training. UCE excels in cross-species comparisons, but scVI might offer more flexibility for single-species deep learning tasks.

Question 3

What GPU do I need for the UCE 33-layer model?

Accepted Answer

The README recommends an 80GB GPU for the 33-layer model with a batch size of 25. For lower-spec GPUs, you may need to reduce batch sizes or use the 4-layer variant, which is less resource-intensive.

Question 4

Can UCE handle datasets with non-standard gene annotations?

Accepted Answer

UCE requires gene names in .var_names, not ENSEMBL IDs, as specified in the README. If your dataset uses non-standard annotations, you may need to preprocess it to match the unified gene vocabulary, which could be a barrier.

Question 5

How to install UCE and its dependencies?

Accepted Answer

Install via pip with requirements.txt, which includes PyTorch and HuggingFace Accelerator. Ensure your environment supports GPU acceleration for optimal performance, as per the installation notes.

Question 6

Are UCE embeddings from different models comparable?

Accepted Answer

No, embeddings from the 4-layer and 33-layer models are not compatible, as stated in the data section. Stick to one model version for consistent results in analyses like clustering or visualization.

UCE

What is UCE?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions