Coder Social home page Coder Social logo

textmessageclassification's Introduction

What is the file meaning?
/Divide : 
	it is the eclipse project and it can be imported by eclipse. its function is Word segmentation.
/Divide/IKAnalyzer2012_u6.jar:
	the tool jar for word segmentation
/Divide/lucene-core-3.6.0.jar:
	the tool jar for word segmentation
/Divide/*_result.txt
	the output file after word segmentation
/Message.py : 
	get the feature from the training text by IG. 
/all_feature.txt: 
	The output of the Message.py with all the feature of this text.
/deal.py: 
	read the feature and training the training data. Print the Right-Rate and get testing_classify-result.
/feature.txt: 
	the feature whose distinction is biggger.
/testing_classify_result.txt: 
	the output of the deal.py with the classify result for testing file
/testing_result.txt: 
	The output of the Divide project with all the possiable word in the testing file.
/training_result.txt: 
	The output of the Divide project with all the possiable word in the training file.


Worning: 
	Default Running Enviroment Is Python 3.6, If You Do Not Have It, Please Delete The First Line In The Python File. i.e. (Del: #! /usr/bin/env python3.6)


Waht is the Runing Order?

Note: the every following step is the independent step. You can run any step you want.

1.Word Segmentation
 >Import the eclipse, and open the test.java. 
 >Change the paramter of the FileReader to the name of which text you want to do word segmentation. 
 >Change the paramter of the FileWrite to the file name you want to output. 
 >Run 

2.Get Feature
 >Change all the Input and Output file name which is you want to run
 >Change how many features you want get by the NumOfFeature
 >Run message.py

3.Get the Result
 >Change all the Input and Output file name which is you want to run 
 >Run deal.py

What is the result of the Running?

>Word Segmentation
	It get a file which is divided by word segmentation with keeping the original file form.
>Get Feature
	Get two files. one are the featrues we want to get with the form of ('word','distinction'). another are all the features of the file with the form of ('word','distinction').
>Get the Result
	Print the Right-Rate
	Get a file which is the testing result with the form 0 or 1 following the roiginal sentence for each line of the tesing file.

textmessageclassification's People

Contributors

xuechaofelix avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.