Nice SQLi vulnerability you got there ;-)<p>> <i>making this project was the most fun I have had in some time haha!</i><p>> <i>sorryyyyy for vibe coding it though. Peace. I am only human after all […]</i><p>Well, yes, of course the whole app was written by an LLM. I’m not surprised at all.<p>---<p>Request:<p><pre><code> POST /?user=play&add_http_cors_header=1 HTTP/1.1
Host: play.clickhouse.com
Content-Type: text/plain;charset=UTF-8
User-Agent: Mozilla/5.0 (KHTML, like Gecko) Chrome/109.0.5414.120
Accept: */*
Origin: https://serjaimelannister.github.io
Referer: https://serjaimelannister.github.io/
SELECT username, total_words, global_rank, total_active_users,
concat(toString(global_rank), ' / ', toString(total_active_users)) AS placement,
round(100 * (1 - (global_rank / total_active_users)), 2) AS percentile
FROM (
SELECT by AS username, sum(length(splitByWhitespace(text))) AS total_words,
rank() OVER (ORDER BY sum(length(splitByWhitespace(text))) DESC) AS global_rank,
count(*) OVER () AS total_active_users
FROM hackernews_history WHERE type = 'comment' AND deleted = 0 AND notEmpty(by)
GROUP BY by
) WHERE username = '' OR 1=1;--' FORMAT JSON
</code></pre>
Response:<p><pre><code> This message is too large to display</code></pre>
It’s funny how I spend so much time on HN, yet couldn’t point out a single username (that I don’t know IRL) besides dang.<p>This is one reason I feel an odd disconnect (anonymity?) with HN that isn’t felt on other social platforms I’ve been a part of. Those often have avatars or some other visual form of recognition that helps put a “face” to a name.<p>I’m not sure if that’s a good or bad thing, but I definitely think it’s intentional.
Reddit was originally designed this way, and HN sort of accidentally copied it. Back then, we always said, "content is first". We wanted people to get upvotes for their content, not for who they were.<p>I prefer it that way.
As an old lag there is a fairly large number of names which I recognise on sight, quite a few of them from the old days of /r/programming and even the main reddit. I'd have trouble listing many of them completely unprompted though.
I've had these same opinions for years. It is an under appreciated social network of some of the top minds and quality comments.<p>I've been collecting a long list of ideas on what you're describing. Thanks to AI encouraging me to really dive in and use it, I've been quietly working on something for what you're describing.<p>First step is to improve the HN UX a tiny bit and flesh out a framework for how to code it. Next will add some interesting social features I've been brewing on. Why can't I easily follow someone?<p>Open source. GPLv3. It isn't perfect, but this is not AI vibe slop, and there are lots of tests from day one. I want to make this sustainable over a long period of time and become genuinely useful to a community that I've gotten a lot out of.<p>Note, the chrome store is really slow at getting releases out (or I'm too fast), best to install from github releases. It is also buggy and I'm fixing and improving things as fast as I can.<p><a href="https://orangejuiceextension.github.io/" rel="nofollow">https://orangejuiceextension.github.io/</a>
Another thing is that lacking the freedom to delete our own comments here, I assume many people treat their account as only a throwaway identity.
I miss DoreenMichele. She always added thoughtful perspectives.<p>Looks like she’s actively writing at <a href="https://califmichele.blogspot.com/" rel="nofollow">https://califmichele.blogspot.com/</a> and <a href="https://doreenmichele.blogspot.com/" rel="nofollow">https://doreenmichele.blogspot.com/</a> but has departed HN.
I'm in the top 1.5%, even though I hardly have written anything here, and the comments are full of similar anecdotes. I guess there's a _ton_ of people lurking, and the active community is actually quite small. I find that quite surprising.
I am at 0.77% with only 73k words. And the top commenter is at 4 million. Is this website counting people with 0 words too?
Long tail. I wouldn't call them lurking they do what satisfies their itch.
I prefer the word “reading”, too.<p>I’m reading the comments almost every day, and I write them only if I think I have a point, insight that the other 50 people in the comments didn’t have.
I'm also naturally curious about the byte count --- using the accepted standard of 5 for words to characters, and since I almost never post anything but ASCII, I've been writing approximately 1.25KB per day here; or just over 5.5MB worth of text so far. Considering that English text compresses very well, and using ~20% as a rough ratio, this means that all ~1.2M words of my comments here, compressed, would still fit on one 3.5" floppy disk.
How does it count so fast? Clickhaus preloaded dataset?<p>Top 0.023%, I was surprised! I usually keep it pretty short here, and my account isn't old.
Cool! Just a thought: instead of having to query the Clickhouse cluster whenever a client clicks "View Top 1000 Leaderboard" (which could cause a lot of load), it might be useful to instead fetch the top 1000 every hour (day?) and display the top 1000 as a static list.
Ooh I cracked the top 500. I’m at about 475k words.<p>Took me a few tries to find my user since I wasn’t expecting the case sensitivity.<p>Thanks for this. Another book you could add for comparison purposes would be James Joyce’s Ulysses. Or I guess the unabridged The Stand by Stephen King would be good too.<p>Ooh The Stand (unabridged) is estimated at 473,000 words! I wrote The Stand in comment length. Wow.
"No, I don't think I will" - I already have a sense of how much time I've spent here.
This is pretty cool! This week I was just thinking of vibe coding something with my HN profile as well (e.g, analyze how my writing has changed over the decade-ish of being on here).<p>Also, 95k words written on here apparently. Cool to know haha.
> Top 0.41%<p>If only any of that was useful!<p>On a side note though there is (maybe intentional) case sensitivity? Can't remember how hn usernames work.
So many of these names I feel I know them, but I don't know them, personally.<p>I know them, by tone. I read his/her take on the topic. Turns out you don't need to see any faces or body ratios of any kind to connect with people.<p>Thanks for keeping HN 'stable/sane'!
Two takes:<p>* never meet your heroes/heroines<p>* when you meet f2f with people you've known for decades online, prepared to be whelmed, under or over, depending.<p>People IRL are very often not what you projected. I learned this from UK mailing list interactions over 40 years ago.
for an account i created in june 2024, top %0.54 is a lot. I need to spend less time on HN. more than that, I need stop typing walls of text, has to be annoying to readers! :)
I did rally simple frequency analysis based on corpus source a while ago and the results were super clear, you can tell a corpus by its frequency fingerprint. I wonder if something similar to this could fingerprint bot accounts?
So if we find somebody who uses one-word posts like "interesting" on every comment, have we unmasked .. he who mus(k)t not be named?
Look on my <i>[prolix]</i> words, ye Mighty, and despair!
Huh. In the top 1500, with approximately one GoT worth of text in ~17 years.<p>Also, I recognize four of the top five users as prolific commenters, but dragonwriter doesn’t ring a bell at all. Maybe they frequent all the threads that I don’t.
I feel like a perfect realization Goodhart's Law is about to happen to move up our rankings.
Very cool. I would point out that the search is case-sensitive, and with that being said I'm not sure if HN usernames are case-sensitive.
Global Rank 7089 |
World Count 62,677 |
Percentile Top 0.92% |
Game of Thrones Volume 0.21<p>This would be pretty cool for other sites. My Reddit stats are probably way worse.
Oh my.<p>> Global Rank
> 385 / 774235<p>> Word Count
> 509,412<p>> Top 0.05%<p>I don't know if I'm too long-winded or I comment too much or both. Good to know I'm in the top 400 regardless.
Click [here] to train a 6B model with just your words ...
Neat! Over 300,000, putting me in the top 1,000.
Oh nice!! I am 1935. I am thinking of writing less comments haha to get once to 1984 so that I can say "literally 1984" xD. I mean it would be funny but I will still write comments haha.<p>man I really love this community yes its has its flaws and everything but man do I love it.<p>I don't write blogs or anything because I feel like many people who are really respectable can come and read my comments in here and give me suggestions and help me learn and other things, Its really just a lovely community! (with sometimes heated discussions) but although I must say that the feeling of community can be a sine wave (sometimes up or down imo) but still I just feel this bond to the community :><p>> Oh nice!! I am 1935. I am thinking of writing less comments haha to get once to 1984 so that I can say "literally 1984" xD.<p>> man I really love this community yes its has its flaws and everything but man do I love it.
Yeah, I'm not sure how I feel about it. I love HN but maybe I need another hobby or three.
It would be fascinating to see a word to karma ratio. (Mine would be incredibly low).
You can see the karma of the people with the 11th-100th highest karma at <a href="https://news.ycombinator.com/leaders">https://news.ycombinator.com/leaders</a> . Here are the 60 of those people who are also in the top 1000 on the word count list, sorted by increasing word to karma ratio.<p>Columns are words/karma, words, karma, name.<p><pre><code> 3.5 308431 88008 mooreds
4.1 307127 75567 stavros
4.3 314850 73503 minimaxir
4.3 575909 133629 ColinWright
4.5 429663 96135 walterbell
5.5 320283 58461 wallflower
5.9 463540 78823 paxys
6.1 298839 49063 paulpauper
7.1 450573 63823 cperciva
7.1 685484 97028 simonw
7.2 415385 57466 mpweiher
8.8 435188 49452 Waterluvian
9.4 912601 97058 steveklabnik
9.5 484782 51089 pavlov
9.5 514233 54028 nkurz
9.6 738986 76912 jedberg
9.9 538580 54533 pavel_lishin
10.5 523765 50113 wmf
10.5 562066 53697 kibwen
11.1 649587 58521 pmoriarty
11.2 554531 49316 petercooper
11.3 626706 55613 sp332
11.3 674598 59635 tyingq
11.3 997305 88154 ceejayoz
11.4 774926 67711 davidw
11.8 892827 75358 hn_throwaway_99
12.5 652216 52309 duxup
12.5 627078 49987 Someone1234
12.6 1999366 159310 Animats
13.3 1168121 87843 userbinator
13.5 1425286 105817 pjc50
13.5 771686 56994 lisper
14.1 1143293 81306 crazygringo
14.2 698215 49002 JoshTriplett
14.3 867103 60494 saagarjha
15.4 1628467 105619 toomuchtodo
16.2 787659 48722 amelius
16.3 1285245 78792 WalterBright
16.5 1058282 64324 ryandrake
16.6 892312 53904 ksec
18.8 1038783 55136 bane
19.8 1950935 98675 anigbrowl
19.9 1355066 67997 masklinn
20.0 2510303 125350 pjmlp
20.2 2110424 104359 PaulHoule
20.3 2251499 110917 ChuckMcM
20.5 1497782 73213 jrockway
21.0 1168930 55722 btilly
21.9 2747766 125470 rayiner
22.2 1822427 82045 nostrademons
22.4 1319812 58825 wpietri
24.7 1275113 51702 brudgers
27.6 3131449 113256 TeMPOraL
29.7 2701314 90987 jerf
30.1 2696913 89718 coldtea
31.7 1911252 60198 Retric
37.6 4785959 127149 dragonwriter
38.5 2130838 55318 derefr
39.3 2583878 65748 dredmorbius
42.5 2141376 50383 tzs</code></pre>
Hey Hackernews, You can read my previous comment <a href="https://news.ycombinator.com/item?id=46827731#46828331">https://news.ycombinator.com/item?id=46827731#46828331</a> where I was suddenly writing until I realized that on Hackernews I have written way too many words.<p>I then got the idea of actually figuring out how many. Then I first wanted to try out algolia but then later, I found out about clickhouse and how it had a play and the api for playing is so simple, I am definitely gonna make more projects on top of clickhouse play for HN (seriously my mind got blown because I was assuming that the browser -> api was gonna be hard but it seriously wasn't)<p>Then decided to think to write a github page about it for other people as well.<p>Anyways, this was one of the most fun project I had. So it turns out that I personally have written 0.64 Game of thrones words in Hackernews itself.<p>Dang has written 11.15 Volumes equivalent to game of thrones which is actually really crazy.<p>When I searched dang I was shocked haha. Anyways Dang, If you are reading this, I know that we all like to talk about how moderation of HN has issues but seriously man, the amount of efforts you put in is really lovely & respectable. We all love you.<p>I still feel like there are some issues where people flag anything they dislike which can be frustrating and other things but that still doesn't really impact the moderation and the moderation team (dang) is pretty awesome in my opinion even if the website does have this flaw in my opinion but Hackernews is one of the best websites man!<p>Dang today's your day! We can discuss the issues of flagging and others some other day, Have a nice day now!<p>(Also a little side fact but I picked game of thrones because my name of github is SerJaimeLannister because I was watching game of thrones in my brother's dorm room once in his college room and I literally just thought one or two episodes and started watching from s4 or something and then literally the second I got home, I binge watched Game of thrones till end and then s1 s2 but I think that I haven't watched some seasons I think s3 iirc more but still I loved the show so much and I think I had lost my old github account and naming is always hard especially in programming so picked SerJaimeLannister but this is the reason why I picked the novel equivalent to be game of thrones!)
Holy heck. The first person I looked up was tptacek, who happens to be #2 in the global rank. 4.3 million words!<p>I'm nowhere near that (~125k words), but for many of us, it's a good part of our life's corpus. :)
So basically I was making this for myself but then searched dang (I first searched myself, then pg then dang)<p>So Dang once again,Thank you dang for your moderation and moderation efforts!<p>Hope my project can make you smile or just about anything haha. Cheers & also let me know how funny is the cat video. (wanted to prove I am human because literally people sometimes comment how I sound like AI & sometimes accuse me of such in HN which is yeahh.. beep boop)
Heh. Here's a thread where the most verbose commenters come and write even more. I haven't written nearly as much as I thought: 2,410th out of 774,235 users, 159,634 words, Top 0.31%.<p>A few years ago, I exported my HN and reddit comments along with my personal blog and private notes into a SQLite database. It was millions of words. I had a vague plan of pulling out long, insightful bits and editing them together into a book of essays. I also thought it would be cool to be able to look up my previous thoughts on a topic. Neither ended up happening.<p>I've been meaning to do the same thing to train an LLM, but I'm not sure I particularly need a digital version of me. Though it would be interesting to ask it to write a book for me in my own style.<p>In theory, it'd be the best book I have ever read.
needs a 1/(words/comment-karma) metric!
It's very useful project.
s/Prolificacy/Verbosity/
I'm genuinely concerned not finding my handle in the leaderboard will subconsciously have me believing I don't have an HN problem.