Skip to content

GitLab

  • Menu
Projects Groups Snippets
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Sign in / Register
  • F FEDOT
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 87
    • Issues 87
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 1
    • Merge requests 1
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Package Registry
    • Container Registry
    • Infrastructure Registry
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • ITMO-NSS-team
  • FEDOT
  • Merge requests
  • !182

Closed
Created Nov 23, 2020 by Rosneft rosneft@rosneft_userDeveloper
  • Report abuse
Report abuse

Clustering example implemented

  • Overview 4
  • Commits 5
  • Changes 19

The script examples/clustering.py now contains the example of chain for clustering #175 (closed) .

Added:

  • tuning for clustering
  • ensemble clustering
  • more clustering models

Known issues:

  • Silhouette metric is used for the composition, but it is not appropriate here (the value converges to -1 very quickly).
  • DBScan model is added, but it should be more models to run the experiments
  • The number of clusters for K-means is pre-defined now (should be identified from data)
  • The secondary-only ensembling model shold be added for clustering to mix several predictions

Example of the composite clustering model:

image

Current metrics:

adjusted_rand_score for basic model 0.350061 with iris adjusted_rand_score for composite model 0.568116 with iris adjusted_rand_score for basic model 0.508453 with simple data adjusted_rand_score for tuned model 0.570815 with simple data

Assignee
Assign to
Reviewer
Request review from
Time tracking
Source branch: clustering