remix logo

Hacker Remix

Show HN: News Minimalist – News ranked by significance

71 points by yakhinvadim 2 days ago | 52 comments

Hey HN! I'm the author of News Minimalist — a news aggregator where all news is ranked by significance on a scale from 0 to 10.

The project was born out of personal pain — I wanted a way to read only significant news, like major humanity milestones, or historical political events, filtering out all the celebrity gossip and smartphone releases. But I couldn't find a way to do that — everywhere I looked, the news was ranked by popularity, coverage, or relevance, not significance.

I first tried to solve the problem in the beginning of 2023 with GPT-3 (the top model at that time) by asking it to estimate the significance of some news stories. The results were painfully bad — for some reason, the model preferred tragic, personal stories, completely missing the essence of what makes the news significant. No amount of prompt engineering could fix that.

But it all changed in March 2023 when GPT-4 came out. The scores it gave made much more sense. After a month of work, the first version was ready. News Minimalist had its first successful Hacker News post (https://news.ycombinator.com/item?id=35795388), and I realized that a lot of people had the same problem I had.

I've been working on improving the project ever since. As probably most tech founders, I spent too much time on technical improvements, completely ignoring marketing. But I think that work paid off, and I'm finally satisfied with the scores it gives.

The results are posted on the site: https://www.newsminimalist.com/

Let me know what you think!

Vadim

marc_abonce 1 day ago

Semi-related: Wikipedia's homepage also contains a very minimalist, manually curated news section with only major world events:

https://en.wikipedia.org/wiki/Portal:Current_events

Although that's perhaps way too minimalist?

yakhinvadim 1 day ago

Wikipedia current events page was actually one of the reasons for creating this project!

I was disagreeing a lot with their selection of news, for example one of their recent entries is:

"Two people are killed and eleven others are injured when a bus flips on its side on a highway near Prenzlau, northeast of Berlin, Germany."

My system gave it a significance score of 1.8, so similar news should never get to the main page: https://www.newsminimalist.com/articles/two-dead-and-four-in...

jdthedisciple 1 day ago

Isn't significance heavily subjective though?

A lot of the most signficant stories are political, for example, which someone may have no interest in.

I have had this same idea in the past, tuning to my personal interests.

yakhinvadim 1 day ago

Good point! I actually have this exact question on about page [0], I'll copy my thoughts from there:

I separate significance from importance (or relevance).

Importance is subjective. News about the health of my family members is important to me, but it is not significant to the world.

Significance is objective. It's about how much the event affects humanity as a whole.

[0] https://www.newsminimalist.com/about

redeux 1 day ago

> Significance is objective. It's about how much the event affects humanity as a whole.

I don’t agree with that, at least not in the present. We only know what’s truly significant when we reflect on history. There are very few things we can be certain are significant in the present. Climate change is likely one, but the US debt ceiling and the war in Ukraine don’t seem as likely to me, at least not in the human scale. There are also events that happen that don’t appear significant in the present but will be hugely significant in the future.

yakhinvadim 1 day ago

I actually agree that "true" significance can only be estimated in hindsight.

My goal with this project is not to get "true" significance but to have a setup that gets you 90% of the way there: an automated system that finds events that are likely to affect large groups of people or major systems and filters out most of everyday noise.

There will always be false-positives and false-negatives, but I think it's a good starting point and it should slowly get better as models get smarter.

viciousvoxel 1 day ago

Significant Others just got downgraded to Important Others

jdthedisciple 1 day ago

Makes sense that way. I think you nailed the execution. Good job!

dvh 1 day ago

I've been using rss feed for few months but recently it became borderline useless. For example here is grep of pubDate:

    $ wget -qO - https://rss.beehiiv.com/feeds/4aF2pGVAEN.xml | grep pubDate
    <pubDate>Thu, 16 Jan 2025 17:56:47 +0000</pubDate>
    <pubDate>Thu, 16 Jan 2025 17:56:47 +0000</pubDate>
    <pubDate>Mon, 13 Jan 2025 17:22:34 +0000</pubDate>
    <pubDate>Sat, 11 Jan 2025 16:42:01 +0000</pubDate>
    <pubDate>Thu, 09 Jan 2025 17:52:38 +0000</pubDate>
    <pubDate>Fri, 03 Jan 2025 16:19:25 +0000</pubDate>
    <pubDate>Sat, 28 Dec 2024 17:44:35 +0000</pubDate>
    <pubDate>Wed, 18 Dec 2024 16:44:00 +0000</pubDate>
    <pubDate>Mon, 16 Dec 2024 18:01:56 +0000</pubDate>
    <pubDate>Tue, 10 Dec 2024 18:00:55 +0000</pubDate>
    <pubDate>Fri, 06 Dec 2024 17:43:37 +0000</pubDate>
    <pubDate>Wed, 04 Dec 2024 16:38:38 +0000</pubDate>
    <pubDate>Sun, 01 Dec 2024 18:57:43 +0000</pubDate>
    <pubDate>Fri, 29 Nov 2024 18:07:14 +0000</pubDate>
    <pubDate>Sun, 24 Nov 2024 15:26:59 +0000</pubDate>
    <pubDate>Thu, 21 Nov 2024 17:25:22 +0000</pubDate>
    <pubDate>Tue, 19 Nov 2024 17:47:19 +0000</pubDate>
    <pubDate>Fri, 15 Nov 2024 17:28:00 +0000</pubDate>
    <pubDate>Tue, 12 Nov 2024 17:44:08 +0000</pubDate>
    <pubDate>Fri, 08 Nov 2024 17:22:01 +0000</pubDate>
    <pubDate>Wed, 06 Nov 2024 16:43:10 +0000</pubDate>
So since 6th november there were only 21 articles. Longest streak was 10 days and common is 3 days without any news whatsoever.

yakhinvadim 1 day ago

This rss is not exactly an rss with articles from the main page, but a newsletter I send manually every few days once enough significant stories happen to warrant an email. Each newsletter issue includes 2-5 main articles and 3-7 trending articles.

https://newsletter.newsminimalist.com/

jamie_ca 1 day ago

As an RSS user, I would love an RSS of the main page content, one entry per story over 5.5 is a perfectly reasonable baseline.

Also: It'd be great if you had a feed tag in your HTML head, so RSS readers could pick it up straight out of your homepage URL instead of needing to manually hunt for the right RSS link.

kevincox 11 hours ago

100% the current implementation is "RSS as would be desired by newsletter lovers", but there is already the newsletter for that. If I want batching or similar my reader will handle that, I think it would be best just to have items as they happen appear on the feed.

yakhinvadim 1 day ago

Ah, I didn't know it was a thing! I'll add it to HTML head.

voisin 1 day ago

I second this. This would be a great feature

dvh 1 day ago

So where is the rss feed of most important news per day?

yakhinvadim 1 day ago

I know it's not going to be popular, but to cover the cost of running ChatGPT on that many articles, I made it a part of a premium subscription: https://www.newsminimalist.com/premium#rss

DrPhish 1 day ago

Do you need realtime results, or is an ongoing queue of article analysis good enough? Have you considered running your own hardware with a frontier MoE model like deepseek v3? It can be done for relatively low cost on CPU depending on your inference speed needs. Maybe a hybrid approach could at least reduce your API spend?

source: I run inference locally and built the server for around $6k. I get upwards of 10t/s on deepseek v3

PS: thank you for running this service. I've been using it casually since launch and find it much better for my mental health than any other source of news I've tried in the past.

yakhinvadim 1 day ago

Thank you so much! Always glad to see long-time readers.

There was a period when I considered switching to an open-source model, but every time I was ready for a switch, OpenAI released a smarter and often cheaper model that was just too good to pass up.

Eventually I decided that the potential savings are not worth it in the long term - it looks like LLMs will only get cheaper over time and the cost of inference should become negligible.

DrPhish 1 day ago

Thanks for the reply! This is perhaps not so much a Hacker News type question since this place is very VC focused, but have you considered publishing any papers on your system? I think it would make a fascinating and valuable bit of research.

Or, even farther off the deep-end: have you considered open-sourcing any old versions of your prompts or pipeline? Say one year after they are superseded in your production system?

yakhinvadim 1 day ago

I don't oppose these ideas, but it's a matter of priorities. There's just many other features and improvements to implement that seem more valuable.

yaj54 1 day ago

Nice work. Subscribed.

I had a very similar idea a while back. I wanted to rank news by "impact" which might be more concrete than "significance."

For an LLM prompt, it would be something like:

"estimate the number of people who's lives that will be materially changed by this news." and "estimate the average degree of change for those impacted."

Then impact is roughly the product of those two.

Additionally, I want a version that is tailored to me specifically "estimate the degree of change this will have on my life." + context of my life.

Tangentially, I've found that getting ratings out LLMs works better when I can give all options and request relative ratings. If I ask for rankings individually I get different and less good results. Not enough context length to rate all news from all time in one go though. Any thoughts on that? Maybe providing some benchmark ratings with each request could help? Something I'm exploring.

yakhinvadim 1 day ago

What you're describing is super close to the first version I had!

In the beginning I had 3 parameters: scale (number of people), magnitude (degree of change for those impacted) and additionally potential (how likely is this event to trigger downstream significant events).

The point behind including potential was to separate these two events:

1) A 80 year old dies from cancer 2) An 80 year old dies from a new virus called COVID

This worked roughly well but I kept adding parameters to improve the system: novelty, credibility, etc... The current system works on 7 parameters.

---

I never attempted to give LLM all options and rank them against each other.

1) as you said, for me 20k articles is just too much to fit into context window. Maybe some modern LLMs can handle it, but it wasn't the case for a long time, and I settled on current approach.

2) I don't want the "neighbors" to affect individual article ratings. With the current system I am able to compare news spread over months, because they were all rated using the same prompt.

3) I intentionally avoided giving AI examples, like "evaluate event X given that event Y is 7/10". I want it to give scores with a "clear mind" and not be "primed" to my arbitrary examples.