Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat!: update ExonGenomicCoordsMapper + UtaDatabase #352

Merged
merged 45 commits into from
Aug 21, 2024

Conversation

korikuzma
Copy link
Member

@korikuzma korikuzma commented Aug 9, 2024

So sorry for this large PR. 😓

Close #345 and #332

  • Update and fix bugs inExonGenomicCoordsMapper
  • pin ga4gh.vrs to 2.0.0a10
  • Remove get_genes_and_alt_acs from UtaDatabase. Moved this to ExonGenomicCoordsMapper and renamed to _get_genomic_ac_gene. Will return single gene since genomic accessions are not needed anymore.

Addresses #224 . I still want to re-order private methods and see if there are any places we can clean up. Since we need to get a release out ASAP suggested changes for refactoring will be done in a separate PR.

* Only focused on `UtaDatabase` for now
* `alt_pos_change` was renamed to `pos_change`
Addresses part of #224
* `transcript_to_genomic_coordinates` renamed to `tx_segment_to_genomic`
* `genomic_to_transcript_exon_coordinates` renamed to `genomic_to_tx_segment`
Addresses #224

* Move `get_tx_exons_genomic_coords` from `UtaDatabase` to `ExonGenomicCoordsMapper` as a private method (`_get_tx_exons_genomic_coords`)
Addresses #224

* `genomic_to_tx_segment` will now require inter-residue coordinates to be passed
addresses #224

* initial work for cleaning up exon coord data retrieval
addresses #224

* Use `ExonGenomicCoordsMapper._get_all_exon_coords` instead
* include `ord` in `get_alt_ac_start_or_end` response. This represents the exon number (0-based)
* include include `ord` in `UtaDatabase.get_tx_exon_aln_v_data` response. This represents the exon number (0-based)
@korikuzma korikuzma added the enhancement New feature or request label Aug 19, 2024
@korikuzma korikuzma changed the title feat!: update ExonGenomicCoordsMapper feat!: update ExonGenomicCoordsMapper + UtaDatabase Aug 19, 2024
Co-authored-by: James Stevenson <james.stevenson@nationwidechildrens.org>
@jsstevenson jsstevenson self-requested a review August 20, 2024 19:02
Copy link
Member

@jsstevenson jsstevenson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍 looks good, lots of work!

@korikuzma
Copy link
Member Author

@jsstevenson I moved + modified a UtaDatabase method in 615d747 so will re-request your review

@korikuzma korikuzma merged commit 587136d into staging Aug 21, 2024
12 checks passed
@korikuzma korikuzma deleted the issue-224-new-structure branch August 21, 2024 14:18
korikuzma added a commit that referenced this pull request Aug 21, 2024
Close #345 and #332

* Update and fix bugs in`ExonGenomicCoordsMapper` 
  * Change output for public methods in `ExonGenomicCoordsMapper` (leverage VRS Sequence Location and improve structure for transcript segment data). Renamed `warnings` to `errors`.
  * Resolve offset / genomic location bugs (#345 and #332)
  * Remove `mane_transcript` instance variable and use `mane_transcript_mappings` instead.
  * Refactor code that was unnecessary or extra.
  * Rename arguments in `genomic_to_tx_segment`: `alt_ac` -> `genomic_ac`, `genomic_start` -> ` seg_start_genomic`, `genomic_end` -> `seg_end_genomic`
* pin `ga4gh.vrs` to `2.0.0a10`
* Remove `get_genes_and_alt_acs` from `UtaDatabase`. Moved this to `ExonGenomicCoordsMapper` and renamed to `_get_genomic_ac_gene`. Will return single gene since genomic accessions are not needed anymore.

---------

Co-authored-by: Jeremy Arbesfeld <jarbesfeld@gmail.com>
korikuzma added a commit that referenced this pull request Aug 21, 2024
Close #345 and #332

* Update and fix bugs in`ExonGenomicCoordsMapper` 
  * Change output for public methods in `ExonGenomicCoordsMapper` (leverage VRS Sequence Location and improve structure for transcript segment data). Renamed `warnings` to `errors`.
  * Resolve offset / genomic location bugs (#345 and #332)
  * Remove `mane_transcript` instance variable and use `mane_transcript_mappings` instead.
  * Refactor code that was unnecessary or extra.
  * Rename arguments in `genomic_to_tx_segment`: `alt_ac` -> `genomic_ac`, `genomic_start` -> ` seg_start_genomic`, `genomic_end` -> `seg_end_genomic`
* pin `ga4gh.vrs` to `2.0.0a10`
* Remove `get_genes_and_alt_acs` from `UtaDatabase`. Moved this to `ExonGenomicCoordsMapper` and renamed to `_get_genomic_ac_gene`. Will return single gene since genomic accessions are not needed anymore.

---------

Co-authored-by: Jeremy Arbesfeld <jarbesfeld@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request priority:high High priority
Projects
None yet
3 participants