11 comments

  • franciscop1 hour ago
    This very clearly seems like a bug either in their DMS script, or in the DMS job that they don't directly control, since CSV clearly allows for escaping commas (by just quoting them). Would love to see a bug report being submitted upstream as well as part of the "fix".
    • zarzavat1 hour ago
      CSV quoting is dialect dependent. Honestly you should just never use CSV for anything if you can avoid it, it's inferior to TSV (or better yet JSON/JSONL) and has a tendency to appear like it's working but actually be hiding bugs like this one.
      • j16sdiz1 hour ago
        Most CSV dialects have no problem having double quoted commas.<p>The &quot;dialect dependent&quot; part is usually about escaping double quotes, new lines and line continuations.<p>Not a portable format, but it is not too bad (for this use) either considering the country list is mostly static
  • aquafox22 minutes ago
    I really don&#x27;t understand why people think it&#x27;s a good idea to use csv. In english settings, the comma can be used as 1000-delimiter in large numbers, e.g. 1,000,000 for on million, in German, the comma is used as decimal place, e.g. 1,50€ for 1 euro and 50 cents. And of course, commas can be used free text fields. Given all that, it is just logical to use tsv instead!
  • rglover11 minutes ago
    Considering the scope, this could be more easily resolved by just stripping &quot;, Republic of&quot; from that specific string (assuming &quot;Moldova&quot; on its own is sufficient).
  • davecahill58 minutes ago
    I was expecting a Markdown-related .md issue. :)
  • Surac35 minutes ago
    I personaly would shy away from binary formats whenever possible. For my column based files i use TSV or the pipe char as delimiter. even excel allowes this files if you include a &quot;del=|&quot; as first line
  • cyberax16 minutes ago
    &quot;Sanitize at the boundary&quot;<p>Ah, but what _is_ the boundary, asks Transnistria?
  • vasco10 minutes ago
    The majority of countries official names are in this format. We just use the short forms. &quot;Republic of ...&quot; is the most common formal country name: <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;List_of_sovereign_states" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;List_of_sovereign_states</a>
  • shalmanese1 hour ago
    Did you really name your breakaway republic Sealand&#x27;); DROP TABLE Countries;--?
  • nivertech2 days ago
    just use TSV instead of CSV by default
  • inevletter26 minutes ago
    Huge skill issue. Nothing to see here.