11 comments

  • raudette7 hours ago
    Related: for fun over the holidays, I created an ePub of a paperback copy of &quot;I Brought The Ages Home&quot;, by Charles T. Currelly, which went out of copyright in Canada in 2007 (copyright in Canada changed from 50 to 70 years after the death of the author in 2022, but this did not affect works that were already in the public domain).<p>I couldn&#x27;t find a ebook online, so I found an old paperback copy and created one: <a href="https:&#x2F;&#x2F;www.hotelexistence.ca&#x2F;create-epub-from-paperback&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.hotelexistence.ca&#x2F;create-epub-from-paperback&#x2F;</a><p>Charles T. Currelly was like a real-life Indiana Jones, he was the first director of the Royal Ontario Museum in Toronto, and sourced much of its early collections.<p>Even with modern OCR (I used Mistral&#x27;s here), and a book with limited formatting, it&#x27;s funny how hours of touch-ups are required just to get a glitch-free reading experience (no stray headers, paragraphs, page numbers sprinkled through the text).
  • robin_reala12 hours ago
    I’m a contributor – I did Kafka’s <i>The Castle</i>, Agatha Christie’s <i>Giant’s Bread</i>, and Stella Benson’s <i>The Faraway Bride</i> for this launch – and I’m happy to answer any questions about Standard Ebooks.
    • asveikau49 minutes ago
      What kind of features does &quot;advanced epub&quot; provide that &quot;compatible epub&quot; cannot?
      • robin_reala23 minutes ago
        Our projects are built with the broad assumption that the latest tech is supported across the board: the “advanced epub” is generated by zipping that up and renaming it to .epub. The build step then runs the book through a compatibility pass to remove some of the functionality that simply isn’t there in many readers. For example, it:<p>1. renders the cover as a JPG from the native SVG<p>2. converts more complicated selectors (e.g. :nth-of-type, :empty, etc.) to classes and class selectors<p>3. renders MathML as images<p>4. renders SVGs as images<p>…and a bunch more things to make it work better with older ereaders, which tend hang around for a long time.<p>It’s worth pointing out that the Kobo builds are closer to the “advanced” build as they have great native support for MathML and SVG. But the build step adds in a bunch of Kobo-specific markup to make it work better there.<p>In my experience, Apple Books does the best with the advanced builds, but it doesn’t support SVG covers so you end up with autogenerated ones, which seems a shame. I’ve sent a couple of feedback issues to them over the years, but if anyone else wants to also do this feel free.
    • me_jumper7 hours ago
      I could not find out if there are outstanding todos that I could assign myself to as a newcomer. I&#x27;d like to contribute, but don&#x27;t know where to start. Is there an issue board somewhere with missing books, or something else?
      • acabal5 hours ago
        We have a list of wanted ebooks here: <a href="https:&#x2F;&#x2F;standardebooks.org&#x2F;contribute&#x2F;wanted-ebooks" rel="nofollow">https:&#x2F;&#x2F;standardebooks.org&#x2F;contribute&#x2F;wanted-ebooks</a><p>First-time contributors should select something from the appropriate section, because that gives you the greatest chance of succeeding and the least burden on our reviewers as you get started.<p>Our toolset has a help wanted section and some outstanding issues: <a href="https:&#x2F;&#x2F;github.com&#x2F;standardebooks&#x2F;tools#help-wanted" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;standardebooks&#x2F;tools#help-wanted</a>
      • robin_reala6 hours ago
        Yep! Have a look at our contribution page that lists a bunch of different options, including a list of good first books. <a href="https:&#x2F;&#x2F;standardebooks.org&#x2F;contribute" rel="nofollow">https:&#x2F;&#x2F;standardebooks.org&#x2F;contribute</a>
    • as1mov11 hours ago
      Probably a dumb question, but how do you guys decide (and source) the book covers? I love how they look, but as a philistine can&#x27;t put into words why.<p>Also thanks for doing this, I&#x27;ve read a bunch of stuff (GK Chesterton, Dorothy Sayers, Dashiell Hammett) that I wouldn&#x27;t have otherwise if it weren&#x27;t for this service.
      • robin_reala11 hours ago
        The historical criteria is fine-art style oil painting. These days we’re starting to use first-edition cover art a bunch for more modern productions if it’s good quality. We also tend to use abstract oil paintings for sci-fi.[1] Obviously, all art is sourced from the public domain too. We’ve also started a database of confirmed-US-PD artwork that we can use for future productions.[2]<p>[1] <a href="https:&#x2F;&#x2F;standardebooks.org&#x2F;subjects&#x2F;science-fiction" rel="nofollow">https:&#x2F;&#x2F;standardebooks.org&#x2F;subjects&#x2F;science-fiction</a><p>[2] <a href="https:&#x2F;&#x2F;standardebooks.org&#x2F;artworks" rel="nofollow">https:&#x2F;&#x2F;standardebooks.org&#x2F;artworks</a>
      • acabal5 hours ago
        As Robin mentioned the typical style is &quot;fine art oil painting&quot;, with some wiggle room allowed for exceptionally difficult cases (like Asian-themed books, as there just wasn&#x27;t much fine art on that subject pre-1930).<p>We also require that the art have some kind of connection to the book itself, so it&#x27;s not just some random fine art. Sometimes the connection is a little fuzzy, but we do the best we can given that art must be pre-1930 and also must have been previously published.<p>(My personal favorite artwork selection of the books I worked on is The Communist Manifesto[1]. That painting was actually made specifically for a different book by Willa Cather[2], but I thought the peasant laborer, holding a sickle in one hand, with a faraway look in her eyes as the red sun rises behind her was just <i>too good to pass up</i> for Marx!)<p>1920ish was when it started becoming much more common for books to have illustrated dust jackets, so now that more books from that era and onwards are entering the public domain, we opt to use the first edition dust jacket if it&#x27;s in the appropriate style. Fortunately for us, that era also happens to be the so-called Golden Age of Illustration so it&#x27;s not hard finding beautiful art to use!<p>[1] <a href="https:&#x2F;&#x2F;standardebooks.org&#x2F;ebooks&#x2F;karl-marx_friedrich-engels&#x2F;the-communist-manifesto&#x2F;samuel-moore" rel="nofollow">https:&#x2F;&#x2F;standardebooks.org&#x2F;ebooks&#x2F;karl-marx_friedrich-engels...</a><p>[2] <a href="https:&#x2F;&#x2F;standardebooks.org&#x2F;ebooks&#x2F;willa-cather&#x2F;the-song-of-the-lark" rel="nofollow">https:&#x2F;&#x2F;standardebooks.org&#x2F;ebooks&#x2F;willa-cather&#x2F;the-song-of-t...</a>
    • pastage11 hours ago
      Do you think the things that makes an edition special goes missing while converting to e.g. Standard Ebooks. I remember both the The Castle and Das Schloss like they had typesetting that helped me in perceiving the feel of the book. Is there anyway to preserve that feeling and still keep within the bounds of standardisation you adhere to? (I did a quick look through my copy and it does not seem to be much that makes it unique really, just the size of the book, and the chapter heading graphics..)<p>Do you know if the project try to look at other languages at all?
      • robin_reala10 hours ago
        Nothing particularly in <i>The Castle</i>, from my production of it. As this was not previously PD there wasn’t any Gutenberg (or other) transcription available, so I did my own from the OCR of the original scans. A large part of the feel of the work, to me at least, comes from the extreme sentence &#x2F; paragraph lengths though.<p>We do have a default typography across all our works (the “Standard” in “Standard Ebooks” refers to a standard imprint; think Penguin) but we usually retain specific famous things where possible in a reflowable format. For example, the Mouse’s Tail in <i>Alice’s Adventures in Wonderland</i>,[1] or the letter in E. A. Poe’s “Thou Art the Man”.[2]<p>We don’t take on other languages, no. Our tooling[3] and style guides[4] are tailored specifically to English. Absolutely nothing stopping another project from forking the codebase (it’s GPL-3) and giving it a go.<p>[1] <a href="https:&#x2F;&#x2F;standardebooks.org&#x2F;ebooks&#x2F;lewis-carroll&#x2F;alices-adventures-in-wonderland&#x2F;john-tenniel&#x2F;text&#x2F;chapter-3#:~:text=the%20tale%20was%20something%20like%20this" rel="nofollow">https:&#x2F;&#x2F;standardebooks.org&#x2F;ebooks&#x2F;lewis-carroll&#x2F;alices-adven...</a><p>[2] <a href="https:&#x2F;&#x2F;standardebooks.org&#x2F;ebooks&#x2F;edgar-allan-poe&#x2F;short-fiction&#x2F;text&#x2F;thou-art-the-man#:~:text=the%20following%20letter" rel="nofollow">https:&#x2F;&#x2F;standardebooks.org&#x2F;ebooks&#x2F;edgar-allan-poe&#x2F;short-fict...</a><p>[3] <a href="https:&#x2F;&#x2F;github.com&#x2F;standardebooks&#x2F;tools" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;standardebooks&#x2F;tools</a><p>[4] <a href="https:&#x2F;&#x2F;standardebooks.org&#x2F;manual&#x2F;" rel="nofollow">https:&#x2F;&#x2F;standardebooks.org&#x2F;manual&#x2F;</a>
      • hopelite10 hours ago
        It seems you may be making assumptions that the formatting and typesetting of any particular edition were intentional or even deliberate on the part of the author, not any number of people, from editors to printers, who could and would have influenced those things for various reasons.<p>Something I am rather familiar with is brought out by your mention of the German edition&#x2F;title; that the continental market seems to generally produce books that are far more densely formatted, i.e., smaller font and typesetting, thinner pages, and leading to overall tighter book formats. I actually appreciate it when, e.g., a book is 1&#x2F;2 the size and weight, and usually also made far more durably; but it will invariably compromise any author intention related to the arrangement of the lettering.<p>Maybe you can confirm that based on what seems to be your English and German editions of the same novel.
        • pastage9 hours ago
          Well that depends, there are obviously authors that care about these things. I have no idea what Brods intentions were with the book, and if he cared about layout.<p>The German and Swedish editions I read were similarly typeset, and the first scan I found in English felt similar. What I wanted to know was if there was some thought into it, because the website is nicely designed so striving for a unique typesetting strategy could be a goal.
        • scooke7 hours ago
          The formatting they are referring to is not that of the original text but that of the Standard Ebooks project.
        • woliveirajr8 hours ago
          &gt; tighter<p>I found it amusing, considering all those memes about German words with 35 letters each.<p>And, as I get older, I began to consider letter size relevant to choose a book edition. Gave up buying new books and went for used, older editions with bigger letters.
      • StopDisinfo9107 hours ago
        I seem to remember that they had some very opinionated rules at the beginning regarding allowed spelling and typography. Some of them felt distinctly American to me. I don&#x27;t know if that&#x27;s still the case.<p>Apart from that, they produce nice editions.
        • robin_reala6 hours ago
          It’s en-US typography (flavoured by the Chicago Manual of Style). Spelling is based on the original book, though some modernisations are made. Commits with these always start with [Editorial] for easy later reference, and are typically things like to-day -&gt; today.
    • pastor_williams2 hours ago
      Why don&#x27;t the ebooks include any of the original illustrations? This makes the children&#x27;s books in particular very unattractive.
      • robin_reala1 hour ago
        We do include illustrations, but in fairly defined categories: when they’re factually illustrative, when they’re graphs or charts, when they’re part of the plot (an illustration of a clue in a murder mystery, for example), or when they’re referenced from the text.<p>The last point is why the Beatrix Potter compilation I did ended up with illustrations: the text specifically references the illustrations in a couple of the books (“Can you see…?”) so they remained. It did mean writing 602 pieces of alt text though[1] so it was a fairly major undertaking to include them.<p>[1] <a href="https:&#x2F;&#x2F;standardebooks.org&#x2F;ebooks&#x2F;beatrix-potter&#x2F;short-fiction&#x2F;text&#x2F;loi" rel="nofollow">https:&#x2F;&#x2F;standardebooks.org&#x2F;ebooks&#x2F;beatrix-potter&#x2F;short-ficti...</a>
    • jez5 hours ago
      What’s the approach to embedding fonts in standard ebooks’ epubs? Curious whether there’s a set of fonts that producers are allowed to embed in finished books, or whether there’s project consciously avoids embedding fonts, and if so why.<p>I tried to find a policy page on this via a standardebooks.org site search but nothing looked relevant.<p>I’m asking after realizing that some of my favorite books were books where the ebook had intentional font choices, for example different fonts for chapter titles vs body text, fonts that matched the vibe of the book (historical, more modern, etc.) It would be nice if more ebook readers made it easy to import more than the ~8 fonts they include by default but the next best thing is when the book itself includes a great font.
      • acabal5 hours ago
        The ebooks we produce are entirely in the US public domain, including metadata and any other files. Unfortunately there are basically no good fonts released under the CC0 license. (Most open fonts are released under the OFL license, which is not the same.) Therefore we don&#x27;t embed any font files, except for Standard Blackletter[1] when necessary, which is a font we developed especially for our use based on public domain specimens, and released via the CC0 license.<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;standardebooks&#x2F;standard-blackletter" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;standardebooks&#x2F;standard-blackletter</a>
    • Kerrick6 hours ago
      How did you get these ready for release on Public Domain Day without breaking copyright law during the production process? I am not a lawyer, so I have only a surface understanding.
      • robin_reala6 hours ago
        I’m not based in the US and have no intention of travelling there, so I don’t count myself in much danger for US copyright law.<p>Regardless, my understanding of copyright is that people broadly get annoyed when you infringe by distribution. In this case the distribution didn’t happen until the copyright had expired. People preparing these projects for later launch do it purely on their own machines, and nothing arrives at Standard Ebooks until the day of release.
    • fxbois11 hours ago
      Do you plan to provide PDF ?
      • dajare11 hours ago
        SE makes ebooks available in four formats: &quot;compatible&quot; epub; &quot;advanced&quot; epub; kobo-compatible (&quot;kepub&quot;); and kindle (&quot;azw3&quot;). No PDFs.<p>One of the SE editors experimented with turning SE ebooks into PDFs, though. See more about that here: <a href="https:&#x2F;&#x2F;groups.google.com&#x2F;g&#x2F;standardebooks&#x2F;c&#x2F;Xy2bwiexLeM&#x2F;m&#x2F;fgk5UmyVAAAJ" rel="nofollow">https:&#x2F;&#x2F;groups.google.com&#x2F;g&#x2F;standardebooks&#x2F;c&#x2F;Xy2bwiexLeM&#x2F;m&#x2F;f...</a>
      • cess1111 hours ago
        I don&#x27;t know their reasons but PDF is a rather problematic format so I suspect that&#x27;s why.<p>You can run their EPUB through Pandoc to convert yourself, or put some effort in and setup your own Calibre instance which will do something similar when you ask it to.
    • NoMoreNicksLeft3 hours ago
      &gt;and I’m happy to answer any questions about Standard Ebooks.<p>When will they start assigning catalog numbers to each of their works in the way that Project Guteneberg does? I&#x27;d like a unique id per ebook and since you don&#x27;t (for obvious reasons) use ISBNs, there&#x27;s nothing really to be done. I can&#x27;t use an OCLC id because Standard Ebooks aren&#x27;t consistently listed in Worldcat, I can&#x27;t use Bookbrainz or Open Library ids for the same reason.<p>It costs nothing and is a low-effort fix. It&#x27;s been an industry (and library science) thing for decades or longer. Can Standard Ebooks finally stop the amateur hour crap?
      • acabal3 hours ago
        I know you griped about this in a different thread, but we won&#x27;t be doing that, sorry. You can uniquely identify an ebook and its version by using dc:identifier in combination with dcterms:modified in the metadata file. If you desperately need a filesystem-safe string then concatenate those two and sha it.
        • NoMoreNicksLeft43 minutes ago
          Sha&#x27;s aren&#x27;t human-readable, aren&#x27;t concise. It&#x27;s a shit way of doing it.<p>Not that it matters, most of your catalog is done better by others, I&#x27;ll just go with them. Why would I want books from people who have an aversion for doing things right?
  • idoubtit9 hours ago
    The title makes it look like Public Domain is universal, while the article does mention that this list is only about the USA.<p>&gt; On January 1, 2026, books published in 1930 enter the U.S. public domain.<p>The Copyright laws are different in each country, and it&#x27;s a non-sense in the modern world.<p>A few years ago, I was searching for books written by Alexandra David-Neel. I found them on a Canadian (IIRC) website, but downloads were filtered by geo-IP, since what was in the public domain there was not yet public in France. One of the books I wanted was written before 1900, and not in print since then. Yet the author died in 1969, aged 100, so the French Public Domain for her works will start in 2040.<p>Another example: &quot;As I lay dying&quot; by William Faulkner is now Public Domain in the USA. It was Public Domain in Canada from 2013 to 2023. Then the law changed, and the copyright was extended by 20 years, and reinstated for this book until 2032 — which is 70 years after the author&#x27;s death in 1962.
    • zozbot2347 hours ago
      AIUI the Canadian law change did not reinstate copyright status for works that had lapsed into the public domain, though it did extend duration of existing copyright.
  • thangalin4 hours ago
    Standard Ebooks produces exceptional quality documents. Here&#x27;s my comparison against similar projects:<p><a href="https:&#x2F;&#x2F;dave.autonoma.ca&#x2F;blog&#x2F;2020&#x2F;04&#x2F;11&#x2F;project-gutenberg-projects&#x2F;" rel="nofollow">https:&#x2F;&#x2F;dave.autonoma.ca&#x2F;blog&#x2F;2020&#x2F;04&#x2F;11&#x2F;project-gutenberg-p...</a>
  • con8 hours ago
    I built a small app that includes Standard Ebooks and Gutenberg ebooks for personal library management (and send to Kindle): <a href="https:&#x2F;&#x2F;scriptwerk.com" rel="nofollow">https:&#x2F;&#x2F;scriptwerk.com</a><p>Hopefully this makes discovery of books easier and lets people manage their libraries online - I like Calibre, but it is not great for people who are just getting started.
  • 1313ed0110 hours ago
    It would be interesting to see a combined list taking into account both US &quot;1930 or older&quot; rule and more common internationally &quot;life+70&quot; rule, to see what works have finally escaped both of those and make works a bit less unsafe to make use of, but I have not seen any list like that?
    • wongarsu8 hours ago
      If you have a list of life+70 works, filtering that for works released before 1931 is pretty straight forward<p>The more difficult part of any such list is the editorial decision which works to include. Even if we only cared about books published in English that would be thousands of books each year
    • hopelite9 hours ago
      That sounds like a good project for you then
  • apwheele8 hours ago
    Off topic, but one thing <i>I wish</i> I could do is donate a single copy epub I have the rights to to all libraries. It should be technically possible (many of the places I have lived the local library uses Overdrive).
    • Thorrez8 hours ago
      Ownership of digital files is a murky domain.<p>When you say you have the rights to it, you might only have the right to read it, not to give it to anyone else. What was the license text when you bought it?
      • apwheele8 hours ago
        Oh to be clear, these are books I own the copyright to (published through my own imprint).
        • stockresearcher7 hours ago
          This is a group that gets independent author ebooks into public libraries. Perhaps contact them to find out how to do it:<p><a href="https:&#x2F;&#x2F;indieauthorproject.com&#x2F;authors&#x2F;" rel="nofollow">https:&#x2F;&#x2F;indieauthorproject.com&#x2F;authors&#x2F;</a>
  • nairboon10 hours ago
    That&#x27;s a great project!<p>I have a hypothesis that we&#x27;re getting closer to a cultural inflection point (maybe half a decade out). With every year, more important and very high-quality cultural artifacts enter the public domain, while at the same time, many low quality artefacts are produced (... AI slop). It&#x27;ll be increasingly difficult to choose a good cultural artefict for consumption (e.g., which book to read next or which movie to watch). A very good indicator for quality is time and thus a useful filter.<p>In some years we could have the following: a netflix-like (legal variant of popcorntime) software system (p2p) that serves high-quality public domain movies, for those who like it, even with AI upscaling or post processing.<p>The same would also work for books, with this pipeline: Project Gutenberg -&gt; Standard Ebooks. At the inflection point, there would be a steady stream of high-quality formats of high-quality content, enough to satisfy the demand of cultural consumption. You wouldn&#x27;t need the latest book&#x2F;movie anymore, except for interest in contemporary stuff.
    • carlosjobim9 hours ago
      You don&#x27;t have to wait. You can borrow, purchase, or pirate any books you want.
  • kopirgan11 hours ago
    Does the movie Maltese Falcon too enter public domain?
    • robin_reala10 hours ago
      There are two (1931 and 1941), but no: a movie is its own work. It’s the same with translations.
      • kopirgan10 hours ago
        Thanks.<p>Was referring to the Bogart version.
        • robin_reala10 hours ago
          It’ll arrive in the US public domain in 2037, so a little wait.
  • ChrisArchitect5 hours ago
    Related:<p><i>Happy Public Domain Day 2026</i><p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=46460440">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=46460440</a><p><i>What will enter the public domain in 2026?</i><p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=46117112">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=46117112</a>
  • hindustanuday9 hours ago
    [dead]