PIVOT - Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

Author

Seil Kang

Published

April 12, 2024

이 글은 S. Nasiriany et al. (Google DeepMind) 이 arXiv.2402에 게재한 PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs를 읽고 정리한 글입니다.

TO BE UPDATED

Grounding VLMs to Robot Actions through Image Annotations

3D to 2D Camera Calibration [link]

Prompting with Iterative Visual Optimization

Reuse

CC BY-NC-SA 4.0

Copyright

Copyright 2024. Seil Kang. All rights reserved. All content and materials on this website and articles are the property of Seil Kang. No part of this website and articles may be reproduced, distributed, transmitted, reused, or modified without prior written permission. Unauthorized use of this website and articles may violate copyright laws and international treaties.