jhclark Goto Github PK
Name: Jonathan Clark
Type: User
Bio: Former Carnegie-Mellon PhD student, now on Microsoft Research Translator Team. The code you'll find here is from my grad student days.
Blog: http://www.jonclark.info
Name: Jonathan Clark
Type: User
Bio: Former Carnegie-Mellon PhD student, now on Microsoft Research Translator Team. The code you'll find here is from my grad student days.
Blog: http://www.jonclark.info
A Scala port of the LDC's Champollion sentence aligner for document-aligned parallel corpora.
Azure Tools for VIsual Studio
Hadoop MapReduce training of modified Kneser-Ney smoothed language models
Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms
Microsoft Cognitive Toolkit (CNTK)
colorgcc is a perl script to colorize gcc output. I'm collecting random patches and changes
A workflow management system for researchers who heart Unix.
source crap for emacs chocolatey package
Automatically exported from code.google.com/p/failfinder
Convert a glob to a regex in Scala
A tiny utility for partitioning a group of people into smaller groups (making small meetings/discussion easy).
Just Build It
Homepage
Liberate your NLP data from previous Acts of Senseless Markup Language
A 'time'-like utility for Unix that measures peak memory usage
Moses, the machine translation system
Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability" from ACL 2011.
An AdaGrad optimizer with the FastOSCAR regularizer
Jon's Basic Powershell setup (and other basics of how to setup a new Windows box)
A bash inspiried readline implementation for PowerShell
Joy Zhang's Suffix Array Language Modeling (SALM) Tooklit
Parallelize Unix commands: stdin => (parallel copies of Unix command) => stdout in the same order
Yet another thin Scala wrapper for Hadoop
Command line option parsing for scala
Python tool for monitoring status of home router, modem, and ISP connectivity. Logs and reports up time for each with email and auto-Twitter shaming built-in.
Translation Error Rate (TER)
Automatically exported from code.google.com/p/treegraft
Allow students to turn in their code via a web app.
Automatically exported from code.google.com/p/uglygenerics
zsplit (and eventually other such things)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.