Mathbert: A pre-trained model for mathematical formula understanding
Large-scale pre-trained models like BERT, have obtained a great success in various Natural
Language Processing (NLP) tasks, while it is still a challenge to adapt them to the math …
Language Processing (NLP) tasks, while it is still a challenge to adapt them to the math …
Tangent-CFT: An embedding model for mathematical formulas
When searching for mathematical content, accurate measures of formula similarity can help
with tasks such as document ranking, query recommendation, and result set clustering …
with tasks such as document ranking, query recommendation, and result set clustering …
Introduction to mathematical language processing: Informal proofs, word problems, and supporting tasks
Automating discovery in mathematics and science will require sophisticated methods of
information extraction and abstract reasoning, including models that can convincingly …
information extraction and abstract reasoning, including models that can convincingly …
Mathematical Information Retrieval: A Review
Mathematical formulas are commonly used to demonstrate theories and basic fundamentals
in the Science, Technology, Engineering, and Mathematics (STEM) domain. The burgeoning …
in the Science, Technology, Engineering, and Mathematics (STEM) domain. The burgeoning …
[PDF][PDF] NTCIR-12 MathIR Task Overview.
We present an overview of the NTCIR-12 MathIR Task, dedicated to information access for
mathematical content. The MathIR task makes use of two corpora. The first corpus contains …
mathematical content. The MathIR task makes use of two corpora. The first corpus contains …
Evaluating token-level and passage-level dense retrieval models for math information retrieval
With the recent success of dense retrieval methods based on bi-encoders, studies have
applied this approach to various interesting downstream retrieval tasks with good efficiency …
applied this approach to various interesting downstream retrieval tasks with good efficiency …
Layout and semantics: Combining representations for mathematical formula search
Math-aware search engines need to support formulae in queries. Mathematical expressions
are typically represented as trees defining their operational semantics or visual layout. We …
are typically represented as trees defining their operational semantics or visual layout. We …
Accelerating substructure similarity search for formula retrieval
Formula retrieval systems using substructure matching are effective, but suffer from slow
retrieval times caused by the complexity of structure matching. We present a specialized …
retrieval times caused by the complexity of structure matching. We present a specialized …
One blade for one purpose: advancing math information retrieval using hybrid search
Neural retrievers have been shown to be effective for math-aware search. Their ability to
cope with math symbol mismatches, to represent highly contextualized semantics, and to …
cope with math symbol mismatches, to represent highly contextualized semantics, and to …
A survey in mathematical language processing
Informal mathematical text underpins real-world quantitative reasoning and communication.
Developing sophisticated methods of retrieval and abstraction from this dual modality is …
Developing sophisticated methods of retrieval and abstraction from this dual modality is …