Deyan Ginev · @dginev
307 followers · 415 posts · Server mathstodon.xyz

It is neat to see MetaAI using LaTeXML productively for arXiv preprocessing on their Nougat OCR work.

Good discussion in "5.2 Text modalities": there is indeed a lot of hidden complexity when recovering TeX input strings.

Rather tempting to wish for a way to normalize to "canonical" expressions...

project homepage: facebookresearch.github.io/nou

arXiv preprint:
arxiv.org/abs/2308.13418

#arxiv #latexml

Last updated 1 year ago

Deyan Ginev · @dginev
301 followers · 387 posts · Server mathstodon.xyz

This lovely open access book is also distributed as MathML-native HTML and (to my surprise!) with some help from LaTeXML.

"Artificial Intelligence 3E: foundations of computational agents",
by David L. Poole & Alan K. Mackworth

artint.info/3e/html/ArtInt3e.h

#openaccess #latexml #mathml

Last updated 1 year ago

Deyan Ginev · @dginev
299 followers · 383 posts · Server mathstodon.xyz

Richard Zach announces today that their book:

"forall x: Calgary. An Introduction to Formal Logic"

is now available as accessible HTML, via and

Read more at
openlogicproject.org/2023/07/2

#bookml #latexml

Last updated 1 year ago

Deyan Ginev · @dginev
296 followers · 354 posts · Server mathstodon.xyz

A common story:

"Nothing really worked perfectly – looks great, but is still a little complicated.
So, I [...] recreated the table as a machine-readable YAML file which is transformed to TeX and HTML by using respective templates with Jinja."

x-dev.pages.jsc.fz-juelich.de/

#latexml

Last updated 1 year ago

Leonard/Janis Robert König · @ljrk
349 followers · 12956 posts · Server todon.eu

Currently experimenting with adapting the @pandoc output from to to work with the awesome built for by @dginev. I could also use to convert to and then use through their github.com/dginev/ar5ivist tool for but why the round-trip? The changes are yet incomplete as not all font things are adjusted and right now all footnotes are represented through the same symbol. Yet, I'm quite happy with the intermediate results :-)

This gives me reactive footnotes, either in the margins (almost style) or through hovering, nicer link highlighting, quite acceptably justified text (I'm surprised how far the web has come). I didn't yet tweak the fonts further and I want to keep my code indented.

Left: Run through pandoc with custom HTML template & some filters

Right: Current state as seen on ljrk.codeberg.page/unixv6-allo produced with a minimalist CSS stylesheet I stole from somewhere.

#markdown #html #css #arxiv #pandoc #latex #latexml #tufte #lua

Last updated 1 year ago

Deyan Ginev · @dginev
224 followers · 163 posts · Server mathstodon.xyz

@rowan @jonny

My honest response is, as usual for this theme: "here we go again" 😀
There are so many unfinished LaTeX implementations - but some of them are indeed useful.

I have a "fake" benchmark file that I use to compare various tools, and here's the first encounter with it. It isn't particularly important - one of the many tests.

On my machine you stack up quite well - only tralics, pandoc, hevea and the Rust rewrite of LaTeXML are faster:
gist.github.com/dginev/35236be

#latexml

Last updated 2 years ago

Deyan Ginev · @dginev
177 followers · 102 posts · Server mathstodon.xyz

LaTeXML 0.8.7 was just released!

We're ready for MathML Core, which is expected to be available in all major browsers, early in 2023.

With gratitude to the wider academic community, who helped drive another productive year of extending our TeX interpretation fidelity and our LaTeX ecosystem coverage.

Full release notes at:
github.com/brucemiller/LaTeXML

#mathml #ar5iv #latexml

Last updated 2 years ago

Deyan Ginev · @dginev
105 followers · 42 posts · Server mathstodon.xyz

Hi everyone, Deyan here!

I tend to discuss the journey of converting into : an HTML5 preview site for the world's largest preprint server.

I'm helping to develop the next generation of and , focusing on the most idiosyncratic corners of and math syntax.

And you'll see the occasional AI art / Large language model experiment flying by as well...

#latex #mathml #latexml #ar5iv #arxiv #introduction

Last updated 2 years ago