February 9, 2024

Text-to-Code Generation with Modality-relative Pre-training

The study focuses on the use of pre-trained language models for text-to-code generation. The researchers explore how sequence tokens can be adapted and represented differently depending on their modality. They experiment with separating embedding spaces between modalities during further model pre-training. The research shows consistent improvements in text-to-code generation across two models and two test sets.

Publication date: 8 Feb 2024
Project Page: https://github.com/huawei-noah/noah-research/tree/master/NLP/text2code_mrpt
Paper: https://arxiv.org/pdf/2402.05783

Post Views: 315

root

Exit mobile version

Please allow ads on our site

Looks like you're using an ad blocker. Please support us by disabling these ad blocker.

Press ESC to close

Share Article:

root

Phonetically rich corpus construction for a low-resourced language

SpiRit-LM: Interleaved Spoken and Written Language Model

Please allow ads on our site