INFORMATION TECHNOLOGIES FOR AUTOMATING THE PROCESSING OF CANDIDATE CV TO INCREASE THE EFFICIENCY OF IT TEAM FORMATION
Keywords:
information technology, Python, CV, recruitment automation, spaCy, NLP, HR systemAbstract
The article is devoted to the development of an information technology for automated processing of candidate CV in PDF format using the Python programming language. The approach to extracting, structuring, and further analyzing data with the use of the pdfplumber, spaCy, and pandas libraries is presented. The proposed module enables the identification of key resume elements, including education, skills, contact information, and work experience, followed by the formation of structured data in JSON format. Special attention is given to ensuring the universality of the algorithm for CV with arbitrary structure and Ukrainian-language content. The paper describes the main stages of implementing the software solution, including data flow diagrams, PDF processing schemes, and examples of unit testing of system functions. The developed technology can be used to automate the initial stage of recruitment and integrate with HR analytics systems, thereby improving the accuracy and speed of candidate data processing in the IT team formation process.
References
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Information technologies in economics and environmental sciences

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.