Coder Social home page Coder Social logo

vida-nyu / domain_discovery_api Goto Github PK

View Code? Open in Web Editor NEW
8.0 13.0 4.0 1.26 MB

Domain Discovery Operations API formalizes the human domain discovery process by defining a set of operations that capture the essential tasks that lead to domain discovery on the Web as we have discovered in interacting with the Subject Matter Experts (SME)s.

Home Page: https://domain-discovery-api.readthedocs.io/en/latest/?badge=latest

License: GNU General Public License v3.0

Makefile 0.69% Python 83.01% Shell 0.66% Java 15.64%
domain-discovery topic-discovery information-retrieval text-mining text-classification text-clustering

domain_discovery_api's People

Contributors

kienpt avatar soniacq avatar yamsgithub avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

domain_discovery_api's Issues

image selection

The method for selecting images for the snippet seems to have a problem.
See the screenshot below -- there are two google scholar entries for different people but with
the same picture.
image

need term summary back!

I did a query for R-Tree and many "tree removal" and "landscaping" services showed up. It would be really helpful to see the irrelevant and relevant terms (e.g., index, efficiency, spatial).

search bug: two concurrent searches --> can't get rid of search query

I used the newest version to test a query that did not work before:
hanan samet

Having the message "searching" is great -- it is now clear the system is working.
But this particular query took too long, so I modified it to
hanan

Then DDT kept swapping from hanan to "hanan samet". After a while it stopped 'blinking' and said "Query failed. Try Google"

At this point, I wanted to check the results for the hanan query. But when I clicked on the X to remove the query "hanan samet", the interface shows the search results, and then immediately goes back to the empty screen.

image

Provide indicator of crawler progress

We need a better indicator that the crawler is working and making progress. What about showing next to "Crawler is Running" the number of pages retrieved?

can't query (potential problem with multi-term queries)

Search queries sometimes do not work. I tried "hanan samet". It did not work.
I restarted the server (did control-C and ran the script again), and it still did not work.

Is this an issue parsing multi-term queries?

See a piece of the log below

MULTIFIELD TERM SEARCH
{'filter': {'or': [{'term': {'query': u'"hanan samet"'}}]}, 'query': {'bool': {'minimum_number_should_match': 0, 'should': []}}, 'fields': ['url', 'description', 'image_url', 'title', 'rank', 'x', 'y', 'tag', 'retrieved']}

172.17.0.1 - - [31/May/2017:04:46:05] "POST /getPages HTTP/1.1" 200 37 "http://localhost:8084/" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_6) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36"

{'filter': {'or': [{'term': {'query': u'"hanan samet"'}}]}}

MULTIFIELD TERM SEARCH
{'filter': {'or': [{'term': {'query': u'"hanan samet"'}}]}, 'query': {'bool': {'minimum_number_should_match': 0, 'should': []}}, 'fields': ['url', 'description', 'image_url', 'title', 'rank', 'x', 'y', 'tag', 'retrieved']}

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.