After an evaluation, GNOME has moved from Bugzilla to GitLab. Learn more about GitLab.
No new issues can be reported in GNOME Bugzilla anymore.
To report an issue in a GNOME project, go to GNOME GitLab.
Do not go to GNOME Gitlab for: Bluefish, Doxygen, GnuCash, GStreamer, java-gnome, LDTP, NetworkManager, Tomboy.
Bug 577982 - Podcast shows duplicate new entries on every refresh
Podcast shows duplicate new entries on every refresh
Status: RESOLVED WONTFIX
Product: banshee
Classification: Other
Component: Podcasting
1.4.3
Other All
: Normal minor
: 1.x
Assigned To: Banshee Maintainers
Banshee Maintainers
gnome[unmaintained]
: 557985 (view as bug list)
Depends on:
Blocks:
 
 
Reported: 2009-04-05 04:34 UTC by namain
Modified: 2020-03-17 08:22 UTC
See Also:
GNOME target: ---
GNOME version: ---



Description namain 2009-04-05 04:34:42 UTC
Please describe the problem:
The Podiobooks "Down and out in the Magic Kingdom" podcast shows a duplicate new episode for every single episode every time my podcasts refresh.  I now have 8 copies of each episode in the feed reader and every time I refresh it gives me a new copy of each.

Steps to reproduce:
1. Subscribe to http://www.podiobooks.com/bookfeed/55879/364/book.xml
2. Load the podcast
3. Refresh the podcast


Actual results:
A new copy of each episode appears

Expected results:
Only one copy of each episode should ever appear

Does this happen every time?
yes

Other information:
so far I have only seen this on this particular podcast, but it did not happen to me while using Amarok or any other podcast reading application I have used with this feed.
Comment 1 Bertrand Lorentz 2009-04-05 16:39:39 UTC
I wasn't able to reproduce this problem.

Could you try to reproduce it with a new database ?
1. Make a backup of the file ~/.config/banshee-1/banshee.db
2. Delete ~/.config/banshee-1/banshee.db
3. Start banshee, it will create a new empty database
4. Try to reproduce the problem.

After the test, you can replace the new ~/.config/banshee-1/banshee.db file with your backup of it.
Comment 2 Bertrand Lorentz 2009-04-13 17:29:41 UTC
After all, I was able to reproduce the behavior you described with the URL you gave. It seems you have to wait for some time for the problem to show itself.

I guess the URL you gave is a custom feed ?

Banshee uses the title and publication date (<pubDate>) to identify each podcast item. It seems that the publication date has changed for all items, so banshee thinks all items are new items, and they are added again.
I haven't been able to check if the pubDate really changes in the raw feed file, or if it's a problem with the way banshee handles it.

I'll try to keep an eye on that.
Comment 3 Bertrand Lorentz 2009-04-14 21:53:59 UTC
My suspicion is now confirmed : I downloaded and saved the feed twice, with a bit more than a day between each download, and here's the difference between the 2 files :

$ diff -u bookrss2_1.xml bookrss2_2.xml 
--- bookrss2_1.xml	2009-04-13 18:47:36.000000000 +0200
+++ bookrss2_2.xml	2009-04-14 23:34:42.000000000 +0200
@@ -7,7 +7,7 @@
 		<description>In this podiobook: Jules is a young man, barely a 
<snip>
all.) Now it's war...</description>
 		<webMaster>webmaster@podiobooks.com (Chris Miller)</webMaster>
 		<managingEditor>editor@podiobooks.com (Evo Terra)</managingEditor>
-		<pubDate>Mon, 13 Apr 2009 12:32:02 -0400</pubDate>
+		<pubDate>Tue, 14 Apr 2009 14:50:51 -0400</pubDate>
 		<category>podiobooks</category>
 		<category>audio book</category>
 		<category>Science Fiction</category>
@@ -49,7 +49,7 @@
 			<category>audio book</category>
 			<category>Science Fiction</category>
 			<guid isPermaLink="false">2a63b6b8f5adaa1975c44cf4d59d4a51</guid>
-			<pubDate>Mon, 13 Apr 2009 12:32:02 -0400</pubDate>
+			<pubDate>Tue, 14 Apr 2009 14:50:51 -0400</pubDate>
 			<author>editor@podiobooks.com (Evo Terra)</author>
 			<enclosure url="http://www.podiobooks.com/chapter/409419/7/70048/PB-MagicKingdom-07.mp3" length="35653960" type="audio/mpeg" />

<snip>

The value of pubDate is changed for each and every item in the feed. This looks wrong to me and I'd say podiobooks has a problem in the way they generated their feed.

We could work around this by relying on the <guid> value to identify items, if it's present.
Comment 4 mxyzptlk 2009-05-24 21:02:56 UTC
I'm just curious here -- you say it's a podiobooks problem. I've noticed this with just about every podcast feed I get.  So that means Banshee is saying all the podcast providers are wrong.  That may be, but that doesn't help us much, unless we petition every provider out there to clean up their xml files -- which isn't likely.  

Why does this happen in Banshee, but it doesn't occur in apps like gPodder or Rhythmbox?
Comment 5 mxyzptlk 2009-05-24 21:04:20 UTC
(In reply to comment #4)
> I'm just curious here -- you say it's a podiobooks problem. I've noticed this
> with just about every podcast feed I get.  So that means Banshee is saying all
> the podcast providers are wrong.  That may be, but that doesn't help us much,
> unless we petition every provider out there to clean up their xml files --
> which isn't likely.  
> 
> Why does this happen in Banshee, but it doesn't occur in apps like gPodder or
> Rhythmbox?
> 

(I should note that this is relatively new behavior, at least on my end.)
Comment 6 Bertrand Lorentz 2009-05-24 21:27:29 UTC
(In reply to comment #4)
> I'm just curious here -- you say it's a podiobooks problem. I've noticed this
> with just about every podcast feed I get. 

Does the value of the pubDate tag change over time for the same <item> ?
I'm not talking about the pubDate of the whole feed, but of individual enclosures/audio files.
Could you provide a URL for one of these podcasts ?

> So that means Banshee is saying all
> the podcast providers are wrong.  That may be, but that doesn't help us much,
> unless we petition every provider out there to clean up their xml files --
> which isn't likely.

Banshee is not saying anything, it can't speak, yet ;)
As I mentioned, we probably should use the guid tag to identify items in podcasts, that's why this bug is still open.

> Why does this happen in Banshee, but it doesn't occur in apps like gPodder or
> Rhythmbox?

Maybe they're using the guid tag, or identifying items in a different way.
Comment 7 mxyzptlk 2009-05-25 17:44:13 UTC
Hi Bertrand,

I added some podcast feeds back into Banshee to check the duplicates (I'd cleared them all out last week).  Here are a few that are showing duplicate entries:

http://feeds.kcrw.com/kcrw/lr

http://podcast.msnbc.com/audio/podcast/MSNBC-MADDOW-NETCAST-MP3.xml

http://www.pbs.org/newshour/rss/podcast.xml

http://feeds2.feedburner.com/QTransmissions.xml

http://disinfo.libsyn.com/rss

I noticed that not all the entries are duplicated, but most of the recent ones are.  

I can't wait until Banshee can talk -- then it can tell us exactly what needs to be done. 
Comment 8 langos.p 2009-06-11 20:27:53 UTC
Hi all, i have got same issue. Archinux, i have tried binary package from official repo as well compiled banshee-git.

Podcast feeds:

http://radiofm.sk/podcasting/wilsonic
http://radiofm.sk/podcasting/experimental
http://radiofm.sk/podcasting/nuspirit

and propably all feeds on http://radiofm.sk/podcast/list

BTW great music, at least first 3 links :)
Comment 9 David Nielsen 2010-12-17 20:19:14 UTC
*** Bug 557985 has been marked as a duplicate of this bug. ***
Comment 10 André Klapper 2020-03-17 08:22:54 UTC
Banshee is not under active development anymore and had its last code changes more than three years ago. Its codebase has been archived.

Closing this report as WONTFIX as part of Bugzilla Housekeeping to reflect
reality. Please feel free to reopen this ticket (or rather transfer the project
to GNOME Gitlab, as GNOME Bugzilla is being shut down) if anyone takes the
responsibility for active development again.
See https://gitlab.gnome.org/Infrastructure/Infrastructure/issues/264 for more info.