ACL Anthology
News
(current)
FAQ
(current)
Corrections
(current)
Submissions
(current)
GitHub
This page is part of a
temporary preview
of a proposed change that may be incomplete or contain mistakes. It is
not official
and will be removed when the change is merged or abandoned.
Durashi
Langappuli
2020
pdf
bib
Dialog policy optimization for low resource setting using Self-play and Reward based Sampling
Tharindu Madusanka
|
Durashi Langappuli
|
Thisara Welmilla
|
Uthayasanker Thayasivam
|
Sanath Jayasena
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
Search
Co-authors
Sanath Jayasena
1
Tharindu Madusanka
1
Uthayasanker Thayasivam
1
Thisara Welmilla
1
Venues
paclic
1
Fix author