From a923214f524b0fedf74c0ad392d92094cea23b88 Mon Sep 17 00:00:00 2001 From: Andreas Waidler Date: Sun, 26 Jun 2011 16:24:01 +0200 Subject: [PATCH] GPN is over. Prepared release of 0.3.0. --- README | 36 ++++++++++++++++-------------------- agg.1 | 14 ++------------ 2 files changed, 18 insertions(+), 32 deletions(-) diff --git a/README b/README index 819f733..ff1d2fc 100644 --- a/README +++ b/README @@ -16,6 +16,7 @@ networking. ## news +* 2011-06-26 agg-0.3.0 released * 2011-05-11 agg-0.2.1 released * 2011-05-10 agg-0.2.0 released * 2011-04-16 agg-0.1.1 released @@ -25,6 +26,15 @@ networking. ## changelog +### 2011-06-26 agg-0.3.0 + +* When items have conflicting names, the one with the most + recent publication date will now be stored. +* Items are now allowed to be ordered arbitrarily. +* Properties of items are now allowed to be ordered + arbitrarily. +* Fixed minor bugs in handling broken feeds. + ### 2011-05-11 agg-0.2.1 * Adjusted documentation. @@ -86,18 +96,10 @@ download-and-news-aggregation-program. wget $URL -O - | agg -### But this wastes traffic when there are no new items! - -agg quits when it assumes that there are no new feeds (see -bugs). The amount of data read too much depends on the -ratio of processing vs. download rate. - - wget $URL -O - --limit-rate=10K | agg - -### Okay. But it only works on a single feed! +### But it only works on a single feed! for feed in `cat feeds`; do - (wget $feed -qO - --limit-rate=10K | agg) & + (wget $feed -qO - | agg) & done You get the point. @@ -132,9 +134,6 @@ Again, its a news aggregator not a caching program. rather low and without much consequences (you can always follow the link). If you encounter a link that is larger than 8KiB, let me know. -* Assumes items are ordered descending by publication date - (newest items on top). Processing is stopped as soon as - an old item is encountered. * Assumes items only change if their publication date changes. Again, for simplicity. * Creation of a "sub-feed" directory if the channel @@ -143,18 +142,14 @@ Again, its a news aggregator not a caching program. * Supports only dates that have their time zone formatted as +xxxx, not as their abbreviation. * Item titles may conflict, especially if they were too - long and have been cutted. -* Items will always be (over-) written in the order they - are placed in the feed. + long and have been cutted. In this case, the item with + the most recent publication date will be stored on the + disk. * Standard mtime for items without pubDate should be now. * Sometimes, mtime of feed directory is set to current time. This seems to happen only when a "new" item is not already stored locally. If it is, the mtime is not modfified. -* agg requires that the first element of an item is either - title or description, and that the former comes before the - latter. Many feeds are not formatted this way, and agg will - abort when encounterin this issue. ## authors @@ -172,6 +167,7 @@ Again, its a news aggregator not a caching program. ## download +* * * * diff --git a/agg.1 b/agg.1 index 16682fd..ec69ad2 100644 --- a/agg.1 +++ b/agg.1 @@ -1,4 +1,4 @@ -.TH AGG 1 2011-05-11 agg "the news aggregator" +.TH AGG 1 2011-06-26 agg "the news aggregator" .SH NAME .B agg @@ -72,10 +72,6 @@ Uses fixed size buffers to simplify code. May lead cut-off news texts or links. The chances for this to happen are low and without much consequences. .IP * 2 -Assumes items are ordered descending by publication date -(newest items on top). Processing is stopped as soon as an -old item is encountered. -.IP * 2 Assumes items only change if their publication date changes. Again, for simplicity. .IP * 2 @@ -85,13 +81,7 @@ an element that had a title tag but is not an item. Supports only dates that have their time zone formatted as +xxxx, not as their abbreviation. .IP * 2 -Item titles may conflict, especially if they were too long -and have been cutted. -.IP * 2 -Items will always be (over-) written in the order they are -placed in the feed. -.IP * 2 -HTML output is formatted badly. +Item titles may conflict. .IP * 2 Standard mtime for items without pubDate should be now(). .IP * 2 -- 2.11.4.GIT