Vietnamese Datasets
7 papers with code • 0 benchmarks • 0 datasets
Benchmarks
These leaderboards are used to track progress in Vietnamese Datasets
Most implemented papers
UIT-ViIC: A Dataset for the First Evaluation on Vietnamese Image Captioning
This paper contributes to research on Image Captioning task in terms of extending dataset to a different language - Vietnamese.
Vietnamese Word Segmentation with SVM: Ambiguity Reduction and Suffix Capture
In this paper, we approach Vietnamese word segmentation as a binary classification by using the Support Vector Machine classifier.
Conversational Machine Reading Comprehension for Vietnamese Healthcare Texts
To help machines understand conversation texts, we present UIT-ViCoQA, a new corpus for conversational machine reading comprehension in the Vietnamese language.
SA2SL: From Aspect-Based Sentiment Analysis to Social Listening System for Business Intelligence
In this paper, we present a process of building a social listening system based on aspect-based sentiment analysis in Vietnamese from creating a dataset to building a real application.
ViHealthBERT: Pre-trained Language Models for Vietnamese in Health Text Mining
We introduce ViHealthBERT, the first domain-specific pre-trained language model for Vietnamese healthcare.
New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Category Sentiment Analysis
To address this, we introduce a new Vietnamese multimodal dataset, named ViMACSA, which consists of 4, 876 text-image pairs with 14, 618 fine-grained annotations for both text and image in the hotel domain.