I joined the Scene Understanding Group at Mercedes-Benz Research and Development as a Research Intern to work under the supervision of Felix Embacher and Dr-Ing. Jonas Uhrig. I worked in the analysis of the zero shot capabilities and failure points of vision-language models for image retrieval in the context Autonomous Driving. It was a great opportunity to work in a research settings where solutions directly tested on real world applications and to learn from the experts in the field.