Skip to content

GitLab

  • Menu
Projects Groups Snippets
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Sign in / Register
  • N normcontrol-Document-Parser
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 12
    • Issues 12
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 1
    • Merge requests 1
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Package Registry
    • Container Registry
    • Infrastructure Registry
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Вячеслав Марцинкевич
  • normcontrol-Document-Parser
  • Merge requests
  • !14

Merged
Created Feb 22, 2023 by Вячеслав Марцинкевич@slavamarcinOwner

[Feat] Completely changed the algorithm for highlighting paragraphs of pdf documents, removing duplicates of tabular paragraphs, added documentations

  • Overview 14
  • Commits 9
  • Changes 17

Created by: slavamarcin

  1. The algorithm for highlighting paragraphs has been completely changed by reducing the amount of code, removing unnecessary conditions and loops. Thus, the accuracy of the selection and the speed of work have increased.
  2. Added a function to extract tables and save them in a document context variable
  3. Added a method to remove duplicate paragraphs tables
  4. Fixed variable names
  5. Added documentations for files
Assignee
Assign to
Reviewer
Request review from
Time tracking
Source branch: v.marcin