Aggressive filter for Duplicate items.

  • 1
  • Idea
  • Updated 1 week ago
  • (Edited)
I know this has been asked for many times in the past but any movement on an aggressive duplicate item removal?  The new google news feeds have severe issues with duplicate items.  Can we de-duplicate based only off headline or some other method?  I really like your product otherwise!!
Photo of Thomas Pemberton

Thomas Pemberton

  • 13 Posts
  • 0 Reply Likes

Posted 2 weeks ago

  • 1
Photo of Samuel Clay

Samuel Clay, Official Rep

  • 6386 Posts
  • 1450 Reply Likes
Give me a bit more context. Include screenshots of stories you wish were de-duped.
Photo of Thomas Pemberton

Thomas Pemberton

  • 13 Posts
  • 0 Reply Likes
Photo of Thomas Pemberton

Thomas Pemberton

  • 13 Posts
  • 0 Reply Likes
Another example from today, although I'm sure you get the idea by now  https://imgur.com/g7zInOT

feed
https://news.google.com/news/rss/headlines/section/topic/TECHNOLOGY?ned=us&hl=en&gl=US
(Edited)
Photo of Samuel Clay

Samuel Clay, Official Rep

  • 6386 Posts
  • 1450 Reply Likes
Are the stories empty? NewsBlur already has an aggressive de-duper on a per feed basis. But it needs > 100 characters in a story to check against.
Photo of Thomas Pemberton

Thomas Pemberton

  • 13 Posts
  • 0 Reply Likes
With their old feed (that didn't have many dupe issues) they would have the headline and a one paragraph blurb about the article and what it was.  With the new feed they just seem to have the headline and not much else: