#image-captioning

[ follow ]
fromHackernoon
1 month ago

How LightCap Sees and Speaks: Mobile Magic in Just 188ms Per Image | HackerNoon

In our experiments, we found that the LightCap model achieved efficient inference on mobile devices, processing images in about 188ms on the Kirin 990 CPU.
Artificial intelligence
#deep-learning
fromHackernoon
3 weeks ago

Comparing Chameleon AI to Leading Image-to-Text Models | HackerNoon

In evaluating Chameleon, we focus on tasks requiring text generation conditioned on images, particularly image captioning and visual question-answering, with results grouped by task specificity.
Artificial intelligence
[ Load more ]