- ๐ผ Works at Microsoft
- ๐ฌ Ask me anything!
0dust / resumefilter Goto Github PK
View Code? Open in Web Editor NEWExtracting relevant information from resume using deep learning.
Extracting relevant information from resume using deep learning.
LSTM's are pretty old now, let's try to use transformer models for extracting information.
Hello ,
I am getting the issue with Tkinter , when we run python create_training.py in the console.
I am running in amazon ec2 linux instance and it don't have GUI.
Any alternative way we can do it?
Thanks,
Suhas
Having Gitpod will make it easier to checkout the project rather than first having to mess up with installation.
Hi @0dust ,
I had clone your repository and tried to execute the problem i didn't find any issues or dependency related to the code but i just made some changes in training data added 1000 sample resumes and did training and i changed to 150 epochs i got an accuracy of 0.5 and then i did the prediction of some sample resumes but i haven't the related info like name,email,mobile no,skills ,experience like may i know the issue what i am facing..
Note: while training process its showing like Train on 51 samples, validate on 22 samples
may i know the reason and can you help me in this to get the related info like experience from thge resume ...
Thanks and Regards,
Manikantha Sekhar...
Hi my name is Priya.
I am new to Python. When I try to run this program,
after running the Create_training where is the file stored? cant find Trained_Model folder.
My CVs are getting extracted and after running Create_training and the new window opens for every CV. Do i need to close all windows?
And i am getting a H5py warning when i run Start_training.py. below is the error . Please help
2020-08-12 12:54:18.552284: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library cudart64_100.dll
C:\Users\priya\anaconda3\envs\resumefilter\lib\site-packages\h5py_init_.py:40: UserWarning: h5py is running against HDF5 1.10.5 when it was built against 1.10.4, this may cause problems
'{0}.{1}.{2}'.format(*version.hdf5_built_version_tuple)
Warning! HDF5 library version mismatched error
The HDF5 header files used to compile this application do not match
the version used by the HDF5 library to which this application is linked.
Data corruption or segmentation faults may occur if the application continues.
This can happen when an application was compiled by one version of HDF5 but
linked with a different version of static or shared HDF5 library.
You should recompile the application or check your shared library related
settings such as 'LD_LIBRARY_PATH'.
You can, at your own risk, disable this warning by setting the environment
variable 'HDF5_DISABLE_VERSION_CHECK' to a value of '1'.
Setting it to 2 or higher will suppress the warning messages totally.
Headers are 1.10.4, library is 1.10.5
SUMMARY OF THE HDF5 CONFIGURATION
=================================
HDF5 Version: 1.10.5
Configured on: 2019-03-04
Configured by: Visual Studio 15 2017 Win64
Host system: Windows-10.0.17763
Uname information: Windows
Byte sex: little-endian
Installation point: C:/Program Files/HDF5
Is there any way to store the output obtained from predict.py. At present we can see the output in the console. Please help me in doing this
@0dust When I want to run create_training_data I get a ModuleNotFound Error that says that docx doesn't exist. When I want to install:
PackagesNotFoundError:
The following packages are not available from current channels:
Current channels:
Hi,
I am unable to go down to the resume to tag them since i am unable to drill down. I am only able to see till line 22 of each resume since upto line 22 is only be adjusted into my monitor screen.
Please suggest me the solution.
import os
import sys
from tkinter import *
import tkinter.font as tkFont
import pandas as pd
sys.path.append(os.path.dirname(file))
sys.path.append(os.path.abspath(os.path.join(os.path.dirname(file), '..')))
from read_input import read_docx_and_pdf
.
.
.
.
And then last is
if name == 'main':
main()
python3 start_training.py
2024-04-24 18:27:39.079680: I tensorflow/core/util/port.cc:113] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable TF_ENABLE_ONEDNN_OPTS=0
.
2024-04-24 18:27:39.079914: I external/local_tsl/tsl/cuda/cudart_stub.cc:32] Could not find cuda drivers on your machine, GPU will not be used.
2024-04-24 18:27:39.082191: I external/local_tsl/tsl/cuda/cudart_stub.cc:32] Could not find cuda drivers on your machine, GPU will not be used.
2024-04-24 18:27:39.110750: I tensorflow/core/platform/cpu_feature_guard.cc:210] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 AVX512F AVX512_VNNI FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
2024-04-24 18:27:39.578656: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
###################### Training for line label ###########################
ready to fit training data..
shape of x_train,x_test,y_train,y_test>> (51, 11) (22, 11) (51, 5) (22, 5)
loading embeddings...
/home/lenovo3/resume-parsing/ResumeFilter-master/data/glove.6B/glove.6B.50d.txt
Glove.6B.zip is being downloaded>>>>
it stuck here
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.