« June 2004 | Main | August 2004 »

July 2004 Archives

July 6, 2004

Feature requests

I need to add the following features into news shaker to make it more usefull:

Done (has an X if finished):
X * Delete category and all related sites
X * Delete category place all remaining sites in another category
X * Ability to have one category be a sub category of another
* 2 level categorization (related to the sub category idea above)
X * Automated "real world" testing with accuracy for all categories after a new model build. Should consist of 20 unseen and unmodeled sites that are hand categorized and then have them categorized.
X * A way to save the results from the real world testing in the database and display them.
* A way to post articles that aren't links but are actually html files into the system. (This also allows visitors to view this file.)
X * ability for people to vote for a file that is in the wrong category to be recategorized
X * Increased categorization speed
* Start a test from a new UID and then track where all the results go and view each result individually.
* Making sure that two of the same sites are never added to the database
* Checking and updating sites and getting rid of no longer existant ones.
X * Ability for users to report errors and admins to view them and delete
X * Ability for users to request categories and admins to view them and delete
X * Ability for administrator to recategorize based on users votes to recategorize
On another note i have increased accuracy on testing to the 94% overall accuracy on known documents and i am getting and average of 25% for unknown documents, which isn't horrible but i would like to do much better. I have now began to study and look into a transductive approach that i might begin to use, depending on the results of the next bits of testing.

July 19, 2004

cleaning and moving

Today I am collecting all of the source and all of the data from News Shaker. I am preparing to move everything to a new machine that is on a live connection that everyone from the outside can get to. I will then continue to add the rest of the features mentioned below. The last week has been used to add these features, while making little to no process on improving the results of the learning. I am going to talk with a few people about how to improve the learning before working with that part of the project to much more. Overall most of the code seems quite stable enough but alot needs to be done on the user interface to make it useable by normal people that aren't accustom to odd design and testing set ups. It must be cleaned up before it could ever really be used.

Web 2.0 craziness

View Dan Mayer's profile on LinkedIn


I Power Seekler
I Power Seekler

www.flickr.com
This is a Flickr badge showing public photos and videos from mayer_dan. Make your own badge here.

Creative Commons License
This weblog is licensed under a Creative Commons License.