Coder Social home page Coder Social logo

whazzup's Issues

Fix up tokenizer

The tokenizer will sometimes create tokens like "berlusconi'", which isn't 
right. Need to also look for other bad tokenizations.

Original issue reported on code.google.com by [email protected] on 22 Feb 2011 at 8:47

Support OPML import

It should be possible to add sites simply by uploading an OPML file into 
Whazzup.

Original issue reported on code.google.com by [email protected] on 16 Feb 2011 at 1:52

Move cron service over to actual cron

Now we rerun the scheduled tasks every time the queue is restarted. Not good. 
Instead, move scheduling out to actual cron, and use send.py to start the tasks.

Original issue reported on code.google.com by [email protected] on 23 Jul 2011 at 12:30

Speed up RecalculateSubscription

This is one of the biggest obstacles to scaling. Need to analyze it to see 
where it actually spends the time. If it turns out that parsing the text is the 
time-waster then we should consider caching the word vector, for example by 
marshalling the hash into a database field.

Original issue reported on code.google.com by [email protected] on 26 Jul 2011 at 7:34

Finn.no feeds fail

Error message is "'ascii' codec can't decode byte 0xe5 in position 111: ordinal 
not in range(128)"

Original issue reported on code.google.com by [email protected] on 26 Jul 2011 at 10:17

Keep CheckFeed from blocking the queue

We can do this easily by doing it async. CheckFeed adds a feed-checking task to 
an internal in-memory queue. A sub-thread handles these tasks, and stores 
parsed feeds as in memory object structures. Then it sends a ProcessFeed 
message to the queue, which does the actual processing. Should be trivial to 
implement.

Original issue reported on code.google.com by [email protected] on 26 Jul 2011 at 6:18

Avoid reloading story list for every click

Basically, serve the list with N extra rows. For each vote, send an async 
message to the server (basically the same as now), then hide that row, and 
unhide another row.

Original issue reported on code.google.com by [email protected] on 20 Jul 2011 at 6:55

  • Blocked on: #19

Change over to queue-based approach

If the recalculation of story points as well as downloading of new stories were 
done in a queue-based approach it would make the code a lot more amenable to 
deployment in a cloud-type/service-type environment.

Original issue reported on code.google.com by [email protected] on 7 Feb 2011 at 7:19

Reduce memory footprint

The current memory footprint is excessive, mainly because both the original 
summary text for a post is stored in memory, and also the text vector.

Original issue reported on code.google.com by [email protected] on 6 Feb 2011 at 8:33

feature request: configure number of items in a page

what i see now: 
25 feed items on one page

what i'd like: 
to configure f.ex. one of 10, 15, 20, 25
(motivation: other feature request on marking all on one page as read)

Original issue reported on code.google.com by ingvald on 8 Aug 2011 at 8:32

Seems there are timezone issues

Stories often get a datetime before the present, and so are listed at "60 
minutes", then after a good while they bounce down to "n minutes" and increase 
normally after that. Presumably this is a timezone issue somewhere.

Original issue reported on code.google.com by [email protected] on 25 Jul 2011 at 2:07

Auto-detect feed format

At the moment whazzup needs to know in advance what the format of the feed it 
is loading is. This restriction needs to be removed by auto-detecting the feed 
format.

Original issue reported on code.google.com by [email protected] on 6 Feb 2011 at 7:40

Support more than one dbqueue instance

At the moment we can only run a single dbqueue instance, as we might otherwise 
get contention over the gdbm files. Need to consider how we can get around this.

Original issue reported on code.google.com by [email protected] on 24 Jul 2011 at 10:40

  • Blocking: #20

Cleanly separate UI from backend logic

At the moment the interaction between whazzup.py and the rest of the system is 
a real mix-up of different kinds of calls. Need to build a proper abstraction 
that makes this separation cleaner. Ideally it should make it possible to swap 
in and out different queueing mechanisms.

Original issue reported on code.google.com by [email protected] on 12 Feb 2011 at 12:15

  • Blocking: #9

Build a Google AppEngine version

It should be fairly easy to get this to run on Google AppEngine, which would 
make it possible to set it up as a free service for anyone to use. Would be 
cool to try it out.

Original issue reported on code.google.com by [email protected] on 12 Feb 2011 at 12:17

  • Blocked on: #7

feature request: mark all as read

what i see now:
a list of 25 feed items, where i can mark one item as read at a time.

what i'd like: 
(in addition to fewer/ configurable number of items pr page, )
ability to mark all items on a page as read in one operation.


Original issue reported on code.google.com by ingvald on 8 Aug 2011 at 8:29

Reopen broken DB connections

Occasionally DB connections break, and when they do they never get repaired 
again. Need to come up with some way to detect and fix.

Original issue reported on code.google.com by [email protected] on 20 Aug 2011 at 11:20

Sometimes hangs a long time waiting for feeds

Sometimes, when reading a feed, the worker thread hangs for several minutes 
waiting for the server to time out. This means the entire worker thread hangs 
for the same time, blocking all updates. Should redo the code so that it times 
out more quickly.

Original issue reported on code.google.com by [email protected] on 11 Feb 2011 at 6:34

Produce some statistics on queue load

One way would be to dump stats to HTML (for viewing through the web interface) 
at intervals, and to keep comparable stats for several such intervals.

Some possible measures: number of tasks in the queue, time to process a task, 
wait time for a task in the queue before it's finally processed. Showing max, 
min, average, and right now values for these measures might work.

Original issue reported on code.google.com by [email protected] on 25 Jul 2011 at 7:11

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.