Catalogue search • Linguistik portal • Fachinformationsdienst (FID)

1	#PraCegoVer: A Large Dataset for Image Captioning in Portuguese
	Gabriel Oliveira dos Santos; Esther Luna Colombini; Sandra Avila
	In: Data; Volume 7; Issue 2; Pages: 13 (2022)
	Abstract: Automatically describing images using natural sentences is essential to visually impaired people’s inclusion on the Internet. This problem is known as Image Captioning. There are many datasets in the literature, but most contain only English captions, whereas datasets with captions described in other languages are scarce. We introduce the #PraCegoVer, a multi-modal dataset with Portuguese captions based on posts from Instagram. It is the first large dataset for image captioning in Portuguese. In contrast to popular datasets, #PraCegoVer has only one reference per image, and both mean and variance of reference sentence length are significantly high, which makes our dataset challenging due to its linguistic aspect. We carry a detailed analysis to find the main classes and topics in our data. We compare #PraCegoVer to MS COCO dataset in terms of sentence length and word frequency. We hope that #PraCegoVer dataset encourages more works addressing the automatic generation of descriptions in Portuguese.
	Keyword: #PraCegoVer; image captioning; image captioning in Portuguese; image-to-text
	URL: https://doi.org/10.3390/data7020013
	BASE
	Hide details

Search in the Catalogues and Directories