Coder Social home page Coder Social logo

judgmental's People

Contributors

jcranch avatar pinballslouch avatar rjendoubi avatar ssrebelious avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

judgmental's Issues

Improve layout on smaller screens

From fjmd1 (Francis Davey ):
@Judgmentals It would be great to add a contact email so I can ask you to remove the annoying white space at the LHS

What's the name of the technique for adding conditionals into your CSS so that the format can change depending on the page width? We can use that to remove the left nav column (which is a bit empty right now anyway) on narrower displays.

Missing indexes

These indexes don't exist right now:

NICC
UKFTT-HESC
UKFTT-TC
UKIAT
UKSSCSC

Link to legislation.gov.uk

Similar to issue 1, we could identify all mentions of Acts of Parliament and link to them on legislation.gov.uk.

Tricky enhancement: if "clause X" is mentioned in the same sentence or nearby, link to the correct clause, not just the whole Act.

Analyse and interpret BAILII links

Bailii's files contain lots of links. We should read them and process them:

  • if it is a link to another judgment on bailii, we should try to promote it to a link the relevant judgmental page (without duplicating the efforts of the crossreference linker);
  • if it is a link to some piece of locally stored legislation, we should try to promote it to a link to our preferred relevant legislation page;
  • if it is something else, it should probably be preserved.

Install feedparser

feedparser is needed for legislation.py. It's kludged in at the moment but should be properly installed.

Deal with duplicate citation errors

A few judgments raise errors during the analysis stage complaining that the extracted citation is not unique.

This may be because of duplicated judgments, or because of bugs still remaining in the citation extraction code.

Tagging & categorising

It would be extremely useful if judgments were tagged with keywords and categories, as has been done with PCC complaints data here: http://complaints.pccwatch.co.uk/

So a user could, with ease, view all 'defamation' or 'copyright' cases and filter by judge name, date, court, party name etc.

Fix 'Cite as:' extraction

The extraction of neutral citations from a judgment is a little buggy and also needs a better strategy for extracting citations from titles of judgments where necessary.

Mark up ids of companies named in cases

Lots of cases mention corporations in their title, and perhaps in their body.

Sainsbury's Supermarkets Ltd, R (on the application of) v Wolverhampton City Council & Anor [2010] UKSC 20 (12 May 2010)
Gold Group Properties Ltd v BDW Trading Ltd [2010] EWHC 1632 (TCC) (01 July 2010)

We should get together with OpenCorporates to name match those, so you can easily find a list of cases about one company, and so you can hyperlink to more info about the company the other way.

Specify the open source license of the code

We don't have a license file at the moment, so it isn't licensed. Github doesn't have one by default, they even allow hosting of non-open source but "source visible" code.

Deal with really badly encoded files

Although we've gone to some length to deal with character encoding problems, some files simply declare their character encoding incorrectly and/or are not encoded with an encoding known to humans.

Annotations on each judgment using Disqus

We could just be lazy, and throw annotations on every judgment using Disqus.

Very quick and easy to do, some people will already have accounts, user experience is good. Will see if we get valuable comments.

Be a little bit dynamic

At present we need to rebuild the entire site every time we want to make a small modification to pages but not judgmentts, eg. the menu, title, footer, google analytics etc. This is obviously rubbish and we should probably instead generate a marked up html fragment for judgments and use some server side inclusion.

RSS feeds

Francis D really wants good RSS feeds of judgments.

So really ideally by court, by search term, and that kind of thing.

Full content RSS vital - so can actually read it all in news reader.

Deal with changing filenames

Changes to the code/original judgments could result in some judgments being moved to different locations. We should add links or redirects in this case since someone may have linked to the old location.

This is probably unlikely to occur in practice, but it should be easy to take care of.

Design a better interface to the formatter

By now, the interface to the formatter provided by run.py has a set of command-line options that is becoming bewilderingly numerous and also annoying to use. But on the other hand, these options actually form only a small part of all the options one could want.

Chris and I feel that an interactive launcher would be more helpful. But what technology should it use? Here are the possibilities I can think of:

  • Simple text-based question-and-answer. Advantages: easy to code. Disadvantages: hard to navigate, annoying, ad hoc.
  • Curses or X-based thing. Advantages: nice. Disadvantages: hard to code, possibly restrictive for platforms.
  • HTML form. Advantages: easy to code the form itself, nice. Disadvantages: can't think of a really simple way of interfacing that with Python.

Any suggestions?

Scrape HUDOC

HUDOC is the database of judgments from the European Court of Human Rights.

http://www.echr.coe.int/ECHR/EN/Header/Case-Law/HUDOC/HUDOC+database/

There is no robots.txt, and no conditions of use that I can find.

Update: there is one : "The information and texts available on the Court’s site may be reproduced provided the source is acknowledged. Users should nevertheless be aware that certain information and texts may be protected under intellectual property law, in particular by copyright."

Restore paragraph numbers

From @A_Ecclestone (Andrew Ecclestone)
@Judgmentals Why don't judgments in Judgmental have paragraph numbers? Isn't that needed for citation and intra-judgment references?

This is because the paragraph numbers are inside LI tags, which get stripped as part of our cleanup operation.

Browser toolbar to add judgments to judgmental

You'd go to a judgment on BAILII and press a button to add it to judgmental.

That wouldn't need screen scraping, but we need to consider the legal issues a bit more before doing it.

Search (using Google)

Add a search, use a Google Custom site: search.

That's easier and quicker to implement, and also makes sure we do all we can to get every judgment in Google.

Generate valid HTML!

Currently, the HTML we generate does not validate; validation is clearly a desirable aim.

Improve logging on failures

If we have to abort processing any file, for example in many cases if exceptions of various sorts are raised, a message is written to a log.

It is well worth increasing the quality of those messages to provide more information about what's going wrong.

Upgrade to Python 2.6?

Obviously, we're limited by what runs on the server. But it would be nice to take advantage of more modern Python features. Benefits include the following:

  • It would give us use of "multiprocessing" library for multithreading, thus speeding up on-server conversions;
  • We could employ the "with" idiom to safely and transparently open and close the database and the multiprocessing pool;
  • Um, anything else?

Speedup: best_filename

In convert.py, we (I) rather dumbly pass the court name to best_filename and then use Levenshtein to find the abbreviated name. This is done for every judgment.

We should put the abbreviated names into the courts table instead.

Improve output page formats

We need to better utilise the metadata we have collected.

There are several things we have which we are not using at all:

  • crossreferences in to each page
  • crossreferences out of each page
  • the original bailii URL

And there are probably prettier and better ways to display the things we are using.

Once we have nice indexing pages (an index for each court, etc), we can use the metadata to generate URLs to the appropriate page.

What we need, really, is an improved HTML template for the page.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.