Coder Social home page Coder Social logo

itm13termstatistics's People

Contributors

briquet42 avatar frufru01 avatar koeckman13 avatar medman506 avatar micsti avatar pkpt avatar rwachtler avatar vickytonal avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

itm13termstatistics's Issues

Adding new site popup not responding after wrong input

How to reproduce

On index page try to add a wrong URL to the crawl list (like 12345), this shows an error modal. After dismissing, the input popup is not responding to further user actions. After closing the popup, the "New" button is not responding anymore, so no "Add to crawlist" pop-up can be opened again until complete page refresh.

Edit REST EndPoint (/period/)

Current output

{
    "data" : {
         "amount":15321,
         "word":"die"
    }
}

Expected output

{
    "hallo" : [
         {
             date : "01.11.2015",
             amount: 5
         },
         {
             date : "03.11.2015",
             amount: 20
         },
         {
             date : "15.11.2015",
             amount: 100
         }
    ]
}

What should happen in case of re-crawling a page?

How do we handle the case of re-crawling an (on this day) already crawled page? A few ideas:

Using the old resultfile in case of

  • re-crawling within the same minute / hour
    -- the timestamp is in the filename, a check to see if the minute / hour is the same would be enough

Re-crawling in case of

  • no idea

Some input would be nice.

Parallel Crawlers aren't working as expected

If more than one crawler (= more than one website is crawled at the same time) is working at the same time, the analyzing process can't start until every crawler-thread (even the ones that don't belong to the same website) is finished.

Add an URL-Checker

Requirement

During the process of adding a new domain, a domain check should be performed which verifies that the user input is a valid URL.

NOTE: Should return a notification to the user, that the given URL is invalid. (optional)

Notification after user-triggered crawling of an existing page

There needs to be a notification when the user-triggered crawling of an already existing page is done, A simple implementation for that would be a new field in the database "is_crawling" with values "1" or "0" for true or false.

While the value is true the page is crawled again, when the value switches to false the user gets a notification that his triggered crawling is done either via e-mail or just on-page notification.

Missing REST EndPoint (Words with timestamp and amount per timestamp)

Requirement

"Pro Seite alle vorkommenden Wörter und Anzahl mit Zeiteinschränkung"

Request parameter

domainID - Numeric value
startdate - String (e.g. 01.11.2015)
enddate - String (e.g. 30.11.2015)

Expected result - (max. 10 most popular words)

[
    {
        "hallo": {
            stats: [{
                date: "01.11.2015",
                amount: "3"
            }, {
                date: "03.11.2015",
                amount: "10"
            }]
        },
        "politik": {
            stats: [{
                date: "01.11.2015",
                amount: "3"
            }, {
                date: "03.11.2015",
                amount: "10"
            }]
        }
    }
];

Show message while page is crawled

Right now the "SiteOverview" page shows "No data available in table" after adding a page to the list. It would be nice if there is a little status message that the page is crawled right now and there should be results in X minutes.

Edit page information

Display the URL of the page entry you want to edit in the head next to "Edit Entry" or as deactivated input field to give the user a hint which page he is editing right now.

The description field is nice, could be not enough of a hint tho.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.