Objective:
As part of parsing resume uploaded by end users in HRMS application, KPAI team is going to develop NLP rule based application in GATE tool. For this work we need to have some manually annotated resumes (at least 3000 documents) to develop expected performance application.
Annotation Types
As per current requirements from HRMS team, we are expecting following annotation types should be done by manual annotators.
- Personal Information
- Education Details
- Work Experience
- Skill Set
- Certifications
- Others
| Annotation Type |
Description |
Features |
| Name |
Name of the resume holder |
initials, first name, middle name, last name |
| DOB |
Date of birth of the resume holder |
NA |
| Gender |
Gender of the resume holder |
NA |
| EmailId |
Email Id of the resume holder |
NA |
| MobileNumber |
Mobile Number of the resume holder |
NA |
| Address |
Address of the resume holder |
addressline1, addressline2, street, city, state, country, postal code |
2. Education Details Section
| Annotation Type |
Description |
Features |
| Course |
Course name ex: IT, CSE etc.. |
NA |
| CourseLevel |
Graduation Type ex: B.Tech, Masters etc |
NA |
| EducationMode |
Course completion mode ex: Regular, Distance etc |
NA |
| StartDate |
Course Start Date |
NA |
| EndDate |
Course End Date |
NA |
| InstituteName |
Name of the Institute |
NA |
| University |
Name of the University |
NA |
| Percentage (GPA) |
Percentage/GPA obtained in the course |
NA |
| Description |
Text area under this heading from resume |
NA |
3. Work Experience Section
| Annotation Type |
Description |
Features |
| Employer |
Organization name |
NA |
| Designation |
Designation of the employee |
NA |
| StartDate |
Work experience start date |
NA |
| EndDate |
Work experience end date |
NA |
| Description |
Text area under this heading from resume |
NA |
4. Skill Set Section
| Annotation Type |
Description |
Features |
| SkillName |
Name of the employee skills |
NA |
| Description |
Text area under this heading from resume |
NA |
5. Certifications Section
| Annotation Type |
Description |
Features |
| CertificationName |
Names of the employee certifications |
NA |
6. Other Annotation Types
| Annotation Type |
Description |
Features |
| Heading |
Names of different resume headings Ex: Work Experience, Technical Skills etc |
NA |
Note: At present those are planned at my end, please review and suggest if any other section required from resume
Work Plan
| Task |
Assigned To |
Timeline |
| Need to collect resumes from different regions as well as from different domains |
? |
? |
| Convert different format of resumes (DOCX,PDF etc) into text file format |
Suresh (HRMS Team) |
? |
| Load them in GATE interface and do manual annotations |
Murali (HR) |
? |
| Build rule based engine by observing patterns from manually annotated resumes |
KPAITeam |
? |
Output
Manual annotated resumes for GATE application processing
NOTE: Please review and suggest any changes required in this document. In Work plan section we need to fill assigned person and time lines for the tasks.
And we will provide another documentation for how to do manual annotations in GATE interface.
Thank You
Krishna Reddy