Latest pr news

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation. (arXiv:2005.03361v1 [cs.CL])

By arxiv.org
Published On ::

Neural machine translation (NMT) needs large parallel corpora for state-of-the-art translation quality. Low-resource NMT is typically addressed by transfer learning which leverages large monolingual or parallel corpora for pre-training. Monolingual pre-training approaches such as MASS (MAsked Sequence to Sequence) are extremely effective in boosting NMT quality for languages with small parallel corpora. However, they do not account for linguistic information obtained using syntactic analyzers which is known to be invaluable for several Natural Language Processing (NLP) tasks. To this end, we propose JASS, Japanese-specific Sequence to Sequence, as a novel pre-training alternative to MASS for NMT involving Japanese as the source or target language. JASS is joint BMASS (Bunsetsu MASS) and BRSS (Bunsetsu Reordering Sequence to Sequence) pre-training which focuses on Japanese linguistic units called bunsetsus. In our experiments on ASPEC Japanese--English and News Commentary Japanese--Russian translation we show that JASS can give results that are competitive with if not better than those given by MASS. Furthermore, we show for the first time that joint MASS and JASS pre-training gives results that significantly surpass the individual methods indicating their complementary nature. We will release our code, pre-trained models and bunsetsu annotated data as resources for researchers to use in their own NLP tasks.

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation. (arXiv:2005.03361v1 [cs.CL])

Estimating Blood Pressure from Photoplethysmogram Signal and Demographic Features using Machine Learning Techniques. (arXiv:2005.03357v1 [eess.SP])

DMCP: Differentiable Markov Channel Pruning for Neural Networks. (arXiv:2005.03354v1 [cs.CV])

Pricing under a multinomial logit model with non linear network effects. (arXiv:2005.03352v1 [cs.GT])

Nakdan: Professional Hebrew Diacritizer. (arXiv:2005.03312v1 [cs.CL])

Expressing Accountability Patterns using Structural Causal Models. (arXiv:2005.03294v1 [cs.SE])

Continuous maximal covering location problems with interconnected facilities. (arXiv:2005.03274v1 [math.OC])

Online Proximal-ADMM For Time-varying Constrained Convex Optimization. (arXiv:2005.03267v1 [eess.SY])

Critique of Boyu Sima's Proof that ${ m P} eq{ m NP}$. (arXiv:2005.03256v1 [cs.CC])

DFSeer: A Visual Analytics Approach to Facilitate Model Selection for Demand Forecasting. (arXiv:2005.03244v1 [cs.HC])

Enhancing Software Development Process Using Automated Adaptation of Object Ensembles. (arXiv:2005.03241v1 [cs.SE])

Hierarchical Predictive Coding Models in a Deep-Learning Framework. (arXiv:2005.03230v1 [cs.CV])

Diagnosis of Coronavirus Disease 2019 (COVID-19) with Structured Latent Multi-View Representation Learning. (arXiv:2005.03227v1 [eess.IV])

Multi-dimensional Avikainen's estimates. (arXiv:2005.03219v1 [math.PR])

A Stochastic Geometry Approach to Doppler Characterization in a LEO Satellite Network. (arXiv:2005.03205v1 [cs.IT])

What comprises a good talking-head video generation?: A Survey and Benchmark. (arXiv:2005.03201v1 [cs.CV])

Enabling Cross-chain Transactions: A Decentralized Cryptocurrency Exchange Protocol. (arXiv:2005.03199v1 [cs.CR])

Distributed Stabilization by Probability Control for Deterministic-Stochastic Large Scale Systems : Dissipativity Approach. (arXiv:2005.03193v1 [eess.SY])

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context. (arXiv:2005.03191v1 [eess.AS])

An Optimal Control Theory for the Traveling Salesman Problem and Its Variants. (arXiv:2005.03186v1 [math.OC])

Determinantal Point Processes in Randomized Numerical Linear Algebra. (arXiv:2005.03185v1 [cs.DS])

A Proposal for Intelligent Agents with Episodic Memory. (arXiv:2005.03182v1 [cs.AI])

On Optimal Control of Discounted Cost Infinite-Horizon Markov Decision Processes Under Local State Information Structures. (arXiv:2005.03169v1 [eess.SY])

An augmented Lagrangian preconditioner for implicitly-constituted non-Newtonian incompressible flow. (arXiv:2005.03150v1 [math.NA])

A Gentle Introduction to Quantum Computing Algorithms with Applications to Universal Prediction. (arXiv:2005.03137v1 [quant-ph])

Evaluation, Tuning and Interpretation of Neural Networks for Meteorological Applications. (arXiv:2005.03126v1 [physics.ao-ph])

Electricity-Aware Heat Unit Commitment: A Bid-Validity Approach. (arXiv:2005.03120v1 [eess.SY])

Strong replica symmetry in high-dimensional optimal Bayesian inference. (arXiv:2005.03115v1 [math.PR])

Constrained de Bruijn Codes: Properties, Enumeration, Constructions, and Applications. (arXiv:2005.03102v1 [cs.IT])

Inference with Choice Functions Made Practical. (arXiv:2005.03098v1 [cs.AI])

Eliminating NB-IoT Interference to LTE System: a Sparse Machine Learning Based Approach. (arXiv:2005.03092v1 [cs.IT])

Experiences from Exporting Major Proof Assistant Libraries. (arXiv:2005.03089v1 [cs.SE])

CovidCTNet: An Open-Source Deep Learning Approach to Identify Covid-19 Using CT Image. (arXiv:2005.03059v1 [eess.IV])

Extracting Headless MWEs from Dependency Parse Trees: Parsing, Tagging, and Joint Modeling Approaches. (arXiv:2005.03035v1 [cs.CL])

Fault Tree Analysis: Identifying Maximum Probability Minimal Cut Sets with MaxSAT. (arXiv:2005.03003v1 [cs.AI])

Football High: Helmets Do Not Prevent Concussions

Retired Soccer Star Briana Scurry on Girls Soccer and Concussion Protocols

Retired Soccer Star Briana Scurry on Her Post-Concussion Depression

5 Best Practices for Breadcrumb Navigation

How Personalized Landing Pages Can Make Your Site More Profitable

Is My WordPress Site Secure? 13 Tips for Locking Down Your WordPress Site

5 Lead Generation Website Design Best Practices

Is My WordPress Site ADA Compliant? 3+ Plugins for Finding Out!

What is Website Conversion? [+5 Ways to Improve Conversions]

Website Redesign Process: Your Website Redesign Strategy in 5 Steps

Printed Solar Cells Hold Promise for Unlit Rural Areas

A Different Approach to Coding With React Hooks

(Probably) No NaNoWriMo This Year

Writing a WordPress book. Again.

4K UHD Collection: April 2020

The Finish Line: Right Solutions for the Right Problems

Building Product Transparency— Be Careful What You Ask For

Companies' 'Green' Efforts Include Products’ Material Content

HID Global Sustainability Practices

Panasonic's Security Solutions Start With Energy-Efficient Products

FHWA rule updates protections for workers and drivers in work zones

Ultrabond ECO 885 Premium Grade Polyolefin Backed Carpet Adhesive

Cooperativa Ceramica d'Imola North America Debuts 5 New Programs

Crossville’s Wood Impressions Collection

Propex to Showcase Isis Modular Tile Backing at FloorTek Expo

The Carpet and Rug Institute Presents the 2024 Joseph J.Smrekar Memorial Award

Top 2024 Advances in Alternative Protein

As Traffic Crash Fatalities Rise, Portland Auditor’s Office Recommends Changes to Vision Zero Program

'Apprehensive and fearful': Federal workers await a dismantling under Trump

Blue states prepare to fight Trump administration policies

Subscribe To Our Newsletter