A few years ago I joined a CTO WhatsApp group. I think I was the 14th person there or so. These days this group has more than 270 members. It is an amazing group, with a lot of valuable content.
People have repeatedly asked for an easily searchable index of the group’s history, and at some point I started writing one. I spent a couple of hours parsing the WhatsApp group history, set up an Elasticsearch server on AWS and uploaded the messages there. I spent a bit more time trying to set up Cognito to work, but I ran out of time to spend on that project, and then the log4j vulnerabilities came and I killed that Elasticsearch server.
One nice side effect of the scripts I wrote was that I could get the list of top-posters in a WhatsApp group according to its history which is pretty useful when you’re group admin. I’ve already used it several times. Since I’ve been asked to run it by other group admins, I thought I’d share it openly. The conversation analysis code is still there but mostly unused.
So here it is: https://github.com/lorg/whatsapp_history . You are welcome to use it anyway you see fit.
Have fun, and let me know if you found it useful :)