diff --git a/README.md b/README.md index 979a38a..2c6d756 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,7 @@ ## BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation + + This is the PyTorch implementation of the BLIP paper. The code has been tested on PyTorch 1.9 and 1.10. To install the dependencies, run
pip install -r requirements.txt
@@ -65,3 +67,12 @@ NLVR2 | python -m torch.distributed.run --nproc_per_node=16 --use_env train_nlvr.py +### Citation +If you find this code to be useful for your research, please consider citing. +
+@inproceedings{ALBEF,
+      title={Align before Fuse: Vision and Language Representation Learning with Momentum Distillation}, 
+      author={Junnan Li and Ramprasaath R. Selvaraju and Akhilesh Deepak Gotmare and Shafiq Joty and Caiming Xiong and Steven Hoi},
+      year={2021},
+      booktitle={NeurIPS},
+}