Monarch geneset OGS2.0

DPOGS200378
TranscriptDPOGS200378-TA1887 bp
ProteinDPOGS200378-PA628 aa
Genomic positionDPSCF300026 + 1073004-1084830
RNAseq coverage242x (Rank: top 43%)
Annotation
HeliconiusHMEL0053820.085.99% 
BombyxBGIBMGA007221-TA0.072.40% 
DrosophilaCG1718-PB4e-13542.38% 
EBI UniRef50UniRef50_D7EHR83e-13943.48%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EHR8_TRICA
NCBI RefSeqXP_970754.23e-14043.48%PREDICTED: similar to AGAP012156-PA [Tribolium castaneum]
NCBI nr blastpgi|1892418545e-13943.48%PREDICTED: similar to AGAP012156-PA [Tribolium castaneum]
NCBI nr blastxgi|2700156821e-13543.31%hypothetical protein TcasGA2_TC002276 [Tribolium castaneum]
Group
Gene OntologyGO:00055242.2e-07ATP binding
GO:00168872.2e-07ATPase activity
KEGG pathwaygga:4163865e-115 
 K05643 (ABCA3)maps-> ABC transporters
InterPro domain[519-609] IPR0034392.2e-07ABC transporter-like
Orthology groupMCL10087 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200378-TA
ATGAGTGGAAGACAATTAGGAAATCTCACGACAGCAAAGAGTGATGGTACATTAGCTTTCTCTCCTGAAAATGCTTTGACCAGGAAAGTTACCAAAGATGCCATGGCAAAGGTGGCCTTGGATAATCTCAATGGATTCATAGCTCTTCTATTTGATCCAAGAGTTTTACCTGAACCTAAAGGATTCAATGATTCGTCTGAACTAGAGGCAGCCCTCTCTAAGCCTAATGTCATGAATCACATCCTTGTTGGCATACAGTTTGATGATAGTATGGCCAATGCTACTGAATGGCCAGAAGATATTAATGTGACCTTGAGATTCCCAGCTGTTATGAGGACTCCAATGCTAGAACATCCTTTGAGGATCAGCTGGAGAACGAATCTCTTGTTCCCTTTATTCCCGCAGCCTGGGCCTAGAGTTCCCAAGGACATGTATGGTGGAAAAACACCAGGGTATTCCCCCGAGATGTTCCTGGCGGTGCAGCACGCTGTGTCGCAGGAGATAATAAAACAGAAGACTGGCAAGTCCATCAACACCAAGGTGTACTTACAACGTCTACCTCAACTGTCTTATAGACAGGACGATTTATTAGTTGCTATGGAACGATTTATATCTATGATCATTATGATGTGCTTCGCTTACACGTTCGTTAACACAGTCAGGGTTGTCACCGCTGAAAAGGAAATGCAATTGAAGGAGACAATGACTATAATGGGACTGCCGTCATGGCTGCATTGGTTGGCGTGGTTCATCAAACAATTCTCTTTCCTATTAATATCTGTTATCCTCATGGTCATATTGTTTAAGATTCCCTTCAACTCTACGTCAGACGGCGAAGGGTACGCCGTCCTAACTTTCACACCATGGAGTGTACTATTCTTCTTCTTAATACTATTCGTGATCGCCTCGTTGTCTTTCTGTTTTATGGTCAGCGTGTTCTTCACAAGAGCCAACACAGCGGCGTCGTTCATGGGCCTGGCTTGGTTCTCGACATATTCAGCTTATATGTTGACTCAAATGTTGTATGAAGATATAAGTTTAACGACAAAGCTGTTGCTAAGTTTAATATCGAACACAGCTATTGGTTACGCACTCCAGATGTTGGTAGTCTGTGAGGGAACTTCCAGAGGTTTACAATGGGACGAGTTCTTTATGCCGGTATCCTATCACGATCAGTTCCAGCCGGGTCACGTAGCGCTCATGTTGGTCCTAGATTCGATTCTGTACATGTTAATAGCCGTCTACGTTGAGAAAATTCGGCCGGGTCTATACGGAGTGCCGTTGCCATGGTACTTTCCGTTCACGAAAAGTTTTTGGTGTCCCGATAACACTAAGGTTGCTGCACTAACGAATAAAGACGGCGTCGATCAGGAATACAAGAACGCTTTGTTGAAAGTCATACACGATGAGGAACCAAAGGGAATACCGATGGGAATTAATATAGAGAATCTTACAAAAGTATACAAAGGAAGGAAAAAGGCTGTCGATAATTTGAATCTTAGGATGTATGAAAATGAAATAACAGTTCTGCTTGGTCACAACGGTGCCGGGAAAACAACAACGATATCAATGCTGACGGGTATGGTCCCACCATCATCCGGTTCGGCCACTATAAACGGTTACGACATTACCCGTGAGACGGAACAGGCTCGTAGGTCCATCGGCATATGTCCACAACACAACGTCTTATTCCCGGACCTGACTGTAGCTGAACATATAATATTCTACTCAAGATTGAAAGGAGTCCCCTCCTCGAAGTTACAGGCTGAGGTTGATCATTTCGTCAAACTGTTGGAATTGGAGGAAAAGGAAAGTGACAAAAAGAAAGTTACCACCGCTCTGACAGCAATTAACGTATTTCAGTTATATGTCAAATTTGACAGTTGA

Protein sequence:

>DPOGS200378-PA
MSGRQLGNLTTAKSDGTLAFSPENALTRKVTKDAMAKVALDNLNGFIALLFDPRVLPEPKGFNDSSELEAALSKPNVMNHILVGIQFDDSMANATEWPEDINVTLRFPAVMRTPMLEHPLRISWRTNLLFPLFPQPGPRVPKDMYGGKTPGYSPEMFLAVQHAVSQEIIKQKTGKSINTKVYLQRLPQLSYRQDDLLVAMERFISMIIMMCFAYTFVNTVRVVTAEKEMQLKETMTIMGLPSWLHWLAWFIKQFSFLLISVILMVILFKIPFNSTSDGEGYAVLTFTPWSVLFFFLILFVIASLSFCFMVSVFFTRANTAASFMGLAWFSTYSAYMLTQMLYEDISLTTKLLLSLISNTAIGYALQMLVVCEGTSRGLQWDEFFMPVSYHDQFQPGHVALMLVLDSILYMLIAVYVEKIRPGLYGVPLPWYFPFTKSFWCPDNTKVAALTNKDGVDQEYKNALLKVIHDEEPKGIPMGINIENLTKVYKGRKKAVDNLNLRMYENEITVLLGHNGAGKTTTISMLTGMVPPSSGSATINGYDITRETEQARRSIGICPQHNVLFPDLTVAEHIIFYSRLKGVPSSKLQAEVDHFVKLLELEEKESDKKKVTTALTAINVFQLYVKFDS-