Deciphering undersegmented ancient scripts using phonetic prior
Most undeciphered lost languages exhibit two characteristics that pose significant
decipherment challenges:(1) the scripts are not fully segmented into words;(2) the closest …
decipherment challenges:(1) the scripts are not fully segmented into words;(2) the closest …
Neural decipherment via minimum-cost flow: From Ugaritic to Linear B
J Luo, Y Cao, R Barzilay - arXiv preprint arXiv:1906.06718, 2019 - arxiv.org
In this paper we propose a novel neural approach for automatic decipherment of lost
languages. To compensate for the lack of strong supervision signal, our model design is …
languages. To compensate for the lack of strong supervision signal, our model design is …
Lexinvariant language models
Token embeddings, a mapping from discrete lexical symbols to continuous vectors, are at
the heart of any language model (LM). However, lexical symbol meanings can also be …
the heart of any language model (LM). However, lexical symbol meanings can also be …
Decoding anagrammed texts written in an unknown language and script
Algorithmic decipherment is a prime example of a truly unsupervised problem. The first step
in the decipherment process is the identification of the encrypted language. We propose …
in the decipherment process is the identification of the encrypted language. We propose …
[PDF][PDF] The Use of Project Gutenberg and Hexagram Statistics to Help Solve Famous Unsolved Ciphers.
R Bean - HistoCrypt, 2020 - apprendre-en-ligne.net
Abstract Project Gutenberg, begun by Michael Hart in 1971, is an attempt to make public
domain electronic texts available to the public in an easily available and useable form. The …
domain electronic texts available to the public in an easily available and useable form. The …
Can Sequence-to-Sequence Models Crack Substitution Ciphers?
N Aldarrab, J May - arXiv preprint arXiv:2012.15229, 2020 - arxiv.org
Decipherment of historical ciphers is a challenging problem. The language of the target
plaintext might be unknown, and ciphertext can have a lot of noise. State-of-the-art …
plaintext might be unknown, and ciphertext can have a lot of noise. State-of-the-art …
Monte-Carlo tree search for artificial general intelligence in games
CF Sironi - 2019 - cris.maastrichtuniversity.nl
Abstract Research in Artificial Intelligence has shown that machines can be programmed to
perform as well as, or even better than humans in specific tasks, such as playing Chess …
perform as well as, or even better than humans in specific tasks, such as playing Chess …
Decipherment of substitution ciphers with neural language models
N Kambhatla - 2018 - summit.sfu.ca
The decipherment of homophonic substitution ciphers using language models (LMs) is a
well-studied task in Natural Language Processing (NLP). Previous work in this topic score …
well-studied task in Natural Language Processing (NLP). Previous work in this topic score …
Decipherment as regression: Solving historical substitution ciphers by learning symbol recurrence relations
Solving substitution ciphers involves mapping sequences of cipher symbols to fluent text in a
target language. This has conventionally been formulated as a search problem, to find the …
target language. This has conventionally been formulated as a search problem, to find the …
Nested rollout policy adaptation with selective policies
T Cazenave - Computer Games: 5th Workshop on Computer Games …, 2017 - Springer
Abstract Monte Carlo Tree Search (MCTS) is a general search algorithm that has improved
the state of the art for multiple games and optimization problems. Nested Rollout Policy …
the state of the art for multiple games and optimization problems. Nested Rollout Policy …