Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dimensionality Reduction #17

Open
sanalmert opened this issue Oct 29, 2024 · 0 comments
Open

Dimensionality Reduction #17

sanalmert opened this issue Oct 29, 2024 · 0 comments
Labels
component Related to a RTDIP component

Comments

@sanalmert
Copy link

sanalmert commented Oct 29, 2024

User Story

  1. As a RTDIP user
  2. I want to reduce the dimensionality of incoming data
  3. So that i can use it for further ML processing and so overal simplify my data

Additional context

  • Reducing the redundant data
  • How to allocate strong correlating sources

Acceptance Criteria

  • A component exists that
    • will combine two (or more) columns into one if they are exact duplicates or highly correlate
    • it receives an optional threshold parameter [0-1]
    • will analyze if two columns correlation is higher than the threshold, if so combine them into one by a given method (sum, mean, delete one,..)

Definition of Done

  • Test cases have been created and are running successfully
  • Documentation for the new component was added
  • Github Actions are running without errors
@sanalmert sanalmert converted this from a draft issue Oct 29, 2024
@luccalb luccalb added the component Related to a RTDIP component label Nov 5, 2024
@luccalb luccalb changed the title [Component] Dimensionality Reduction Dimensionality Reduction Nov 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component Related to a RTDIP component
Projects
Status: Product Backlog
Development

No branches or pull requests

2 participants