Monarch geneset OGS2.0

DPOGS207110
TranscriptDPOGS207110-TA5730 bp
ProteinDPOGS207110-PA1493 aa
Genomic positionDPSCF300001 + 3203503-3232874
RNAseq coverage398x (Rank: top 30%)
Annotation
HeliconiusHMEL0132680.084.66% 
BombyxBGIBMGA012789-TA0.083.51% 
DrosophilaCG1718-PB3e-10332.92% 
EBI UniRef50UniRef50_E2BWV80.050.10%ATP-binding cassette sub-family A member 3 n=9 Tax=Endopterygota RepID=E2BWV8_HARSA
NCBI RefSeqXP_001812136.10.054.40%PREDICTED: similar to abc transporter [Tribolium castaneum]
NCBI nr blastpgi|2700092010.052.73%hypothetical protein TcasGA2_TC015859 [Tribolium castaneum]
NCBI nr blastxgi|2700092010.052.71%hypothetical protein TcasGA2_TC015859 [Tribolium castaneum]
Group
Gene OntologyGO:00055242.8e-22ATP binding
GO:00168872.8e-22ATPase activity
GO:00001664.6e-09nucleotide binding
GO:00171114.6e-09nucleoside-triphosphatase activity
KEGG pathwayspu:5887243e-174 
 K05648 (ABCA5)maps-> ABC transporters
InterPro domain[606-728] IPR0034392.8e-22ABC transporter-like
[591-775] IPR0035934.6e-09ATPase, AAA+ type, core
Orthology groupMCL10483 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207110-TA
ATGAGTGATCCTAATGGGATACTGGGGAGGGATGACGGACGAAATTCACCGGCCAGCACCGCTCAGCTAACAAAATGCGACAAGGAGCCAGATTGTGTGGCGATAGGTAGCAGGCTGAGAGGCACGATGGGTACCCGCCCTCCCGCCGCCTTCTGGCCCCAACTATGGGCGACCGTCGTCAGGAATCTGCTGCTCAAGAAACGGGACACCAGAAAAACTCTTGCCGAAGTCTTGGTTCCTCTATATTCCCTGGGTGTGTTGATCTTCTTGAAAATGTTGGTGCCGAACCCGAATTTCCCGGAGGTACGGAAACCTGGCCGTCTGTTAAGGATTCATCACGATGTCATCTCAGATAACCACTCGGTTGCCGTTGTTGCGGATTGGGAAACTGCTAATGGCACTTTGAGTTTCCTGGATGACATCAACTCTCTGCTCCGCGAGTCCGGACAACACCAAATTCACTGGATCCGCTACAACAATACCACGGAACTCAACGATGCGTACCACAACAACGCTAAACATTTCCCACTCGCCATCATCTTCCACACAGACCCTGGAGCTTACGGAGAACCACTTAGATACACAATACGGACGAATCCCTCTCGTGACGGGGGCACTCCGTCCACTCGGACCCTGAGCACGTCTCCAGCGAAATGTCGCGAGCGTACGAAGTCAGATAAGGACTGGTCCTCGGACTGGTCCCGTGGCGGTCAGCTCATCCCACTATCGGAGATGCATCGCGAGGACACCTGCCCTGTACTTCAGTATTACTACACTGGTTTCCTGGCGCTGCAAACATTGATCGACTACGTCAAGATTAAGATGGAGACTGGGGCAAATTTCCTGCCACCTCGGGTGGATTTCCGTCAGTTCCCTAAGCGGCAGCATACTGGCGACTGGCTCGTCATCTTCCGCGTGATCATGCCCATGTACATGGTGATGACGCTATCACAGTTCATCACATACTTACTGATGTTCGTTGTGGGGGAGAAGGAGAAGAAGATCAGAGAAGGAATGAGGATTATGGGATTAAAGGACAGTGTTTATTGGGGTTCTTGGTTCTTGATCTACGCCGTGTTCGTCACAATACTCTCCATCGTCAGCACCGTTTTGCTGTTCACTCTGAAGGTGTTCCAGCACTCCTCATACATCTTGATCTTCCTTCTGATGCTGTTATTTGGTTTCACCATCATAACTTTTGCGTTCATGCTAACTCCCTTCTTCGATAAAGCCAGGACTGCTGGTATACTGGGCAGTTTCGCGGTGAACCTGATGAGCGGTCTCTACTTCATTCAGGTGTTCGTCTCTAACGCTGACTCGTTAGCCTTCTGGTTCGTCTCCCTCATCAGCTCCAGCTGCTATGCTTTGGCTATGGACAAGGCTTTGGTGCTTGATATGGCGGGTGTGGGAGTGACGTGGGAGAACTTGTGGAGCGGCCCCGGGGTGCCGTTCGGGGGAAGTCTGATCATGATGGCATTGGACACCGTGTTATATGGACTGGCTGCTTACTGGCTGGATGCCGTTATACCAGGCGAGTACGGCATCAAACAGAAGCCGTGGTTCTGTCTGCTGCCCTCGTTCTGGTCGCCGAGAGCACGCGTCGCGCAACTGTTGCAGGACGGAAACGTTACCAATAATAAGGACATCGAGCCCGTGCCCAAGGAACTACAAGATAAAGAAGCGATAAGGATAGTTGGTCTTCAAAAAAGTTTCCGGCACTGCCGAAGACCGGAGGTCAAGGCTATTGACGGTATCGACCTGAGTATATACGAGGGTCAGATCACGGCGGTGCTGGGTCACAACGGAGCTGGGAAGTCGACACTCTTCAACATCCTTACCGGCCTCACTTCTCCTACAGCTGGAACTGCTTATGTCTATGGTCTGGATGTTAGAGATCCTAACGACATGCACGAGATCCGTCAAATGATAGGAGTGTGCCCGCAGCAGGACGTGTTGTTCGACCTTCTCTCCGTCAAGGAACACCTTCAGTTCTTCGCGGCTGTTAAGGGTATCCCCCGTAAGCGTGTTCCGGGTGAAGTTCAGAGGGCGTTATCGGAGGTGGGTCTCTTGGATCAAATGCACGTGTTCTCCAAACATCTTTCGGGGGGTCAGAAGAGAAAACTTAGCATCGCCATAGCTTTTATAGGGGATCCAAAGATCATAATTCTGGATGAGCCGACTGCTGGCGTGGACCCTGTCTCCCGTCGTCAGACCTGGCGCGTGTTACAGCGAGCTCGTCGCGGCCGGGTGCTGCTCCTCACCACTCACTTCATGGACGAGGCGGACATTCTCGGGGATAGGAAGGCTGTTATAAGCAAGGGACGGGTTCGCTGTGCTGGTACGTCACTATTTCTGAAGAACAAATTCGGTATCGGATATCATCTGACGCTGGTATTAGACGGCGCATGTCGTGAGCACCAGATCACCCGCCTGGTCCGCGGTCACGTTCCCCGGGCGGAGAAGGCTCGTCGTCATGGTCGCGAGCTGTCCTACATCCTGCCGCACTATACTGTACACTTGTTCCCCCCACTCTTTCAGGCTATCGAGTTAGAGATACGAGAAAAGACCAATAGACTAGGTATCACAAGTTACGGTGTATCAATGACGACTCTTGAGGAGGTGTTCCTCAGTCTGGAGGGAGAAAACGCTGAAGAGACAGAAGCAGTTGAGGGAGTGTCGTCCGTGAAGCTGGTGAGGGCCCGCGCCCTCTCCAGAAGCTTGTCGCTCCAAAGCAAGACCTTAAGCTATCAGGAATTGAATGACAAAGAGCAGCAGAAGACAACTTCCCTGCCGACACCGGCCGCCTCGCACGCACTTCACTCCACCACGCATGGAGTCGAACACGTTAAGGTTACCCCTGAAGCACCAGTTTCCGTAGATGCCCTCAGCGAAGCCCTAGTGACACGTCCGTCATGTTGGCGCACATTCTGCGCTCTCGTTTACATCCGAACCGTGCGAATGATACGGGACCCTTACAAGCTATACGTCATGATCTTCATGCCTATCATTTCCTGCGCTTTGGGTCTATACATGAAGTCCCGCCAGATTGTATTTTTTCGCATGCAGCCACTAAAATTGGATCCCAACGCGTATTTCAATAAAACACCCATCGCTTTGTTTAGTGAATCTGACAATATGGAACAAATTGATCAATTTAGGGATTCCTTGGAAACTTTCGGAGCTCATCCCATTGACATGTTTGATGGGAACTTCTCTAGTTTGCTAGATATGGAGAATTTTGGAGCATTCAGTTTGAAGGACAGCCTCGTGTCGTTTGGGAACATAATGGCGTACTACAATAGTACATACACACACAGTTTGCCAATTATAATAAACTTGTTGGACAATACTATATACAGGGTACTAATGTCATCCACGAACCAGTTGGACAACTTCCGCCCTATCGAGGTCCTCACACATCCCTTCCAGCAGACCGAGCAGCAGGAGGAGTTCAATCTTGGGAACGTAGTGTGTGCAATTTTCATGGGGATGATATTCGCTCTGGTCCCCGTCACGCTCGCTGTGGACATTGTTTATGATAGAGAGATCAAAGCCAAGAACCAGCTGCGGGTGAACGGCCTATCTATGAGCATGTATTTCCTTACTTACTTCACTATTCTCATCTTCATCATGGTGGTTACTAGCGCTGGTGTACTGGCCCTGGTGATGGTGAACGATATCCCAAGCCTGACGAACGGTTCCGCCATCACAATGCTCTGCTTCCTGCTGTTGCTGTACAGCCCCTCCGCCATCTTGTTCAACACCTGCCTCTCATACATCTTCGACAAGATGGACTCCGCTCAGAGCATCATGCCCAATATAACAACCTGGGTCGGAGTCATACCGTTCATTCTGGTAGCTGTTCTCGATACTTTCAAATGGGGTTCCAACATAGCGTTCTATCTCCACCTAGTGTTCAGTTTTCTAGACGTTATGTACATACCTTACGCCATCATATACTATGTTGATAGGGTATACCTGACGTGTAACCTCCGCGGTCTATGTACGGTGCCGGACCTCAGCAGCTACTTCACGGCGGAGGTGTGGGTCCTCATCGCAGCTATGCTACTACACGTCCCGGTCTGTGGGGCCGCGTTGTTGGCTGCTGATAGACTCAAGTCTGGAGGGCGATTATGGCCGCGCAAGAGCAGTACATCAGAGAAAGCGGACGTGGAGGCCGAGGTGGCCGACGGTCTAGAGGACGAGGACGTGCGACGCGAGCGGCGGCGTGTCGGCGCTCTGCTGCACGCACAACGCACTCAGCAGGACAGCACACCGCCGGCGCTGCTCGTTCACAACCTTCGCAAGGAGTACAAGCTGCGTAACTCCCGGAGTGGCGCCTGTTGTGGTGAGGGGGAGTGCTCGAGGCGCGCGGCCCTCGCCCGCCTGTCACTGGCTGGACCGCCACGGCTTCGGTTGGTGGCGCGCACACGAAAGGTCCAATTTTCGATGACTCTGGCGCACAAATCGGGCTGTATTTGATCGATGGCGTGGCGTATGTTGACTTTGAGGGCATCGGTGGTTTGCGGCTTGTTGGCATAGACCTGCGACTTAAGAAAACCCCACAAGAAAAAGTCCAGCGGGGTCAAATCGCACGATCTCGGTGGCCAGTTCACGTCGCCTCCGCGCGAGATGACCATGTCCAGAAATTGCTCGTGCAGAACTTCCATCGTGATGCTGGGCGGTCAGAACATAGAGGAGGCTCAATCCAGCGCCTTTCAGATGTTAGGCTACTGTCCACAACACGACGCGCTCTGGAAGAACGTCACCATCAGGGAACATATCGAGTGTTACGCCGCCATACGAGGAATCAGCAAAGCTGACACACCTCGTATCGTGGAGGCCTACCTACACGGTCTCCAGATCGCGGAGCACGCTGGTAAGAACGCGGAGGAGTGTTCTGGTGGTACCAGGAGGAAGTTGTCCTTCGCTCTGGCGTTAGTTGGCTCGCCGAGGGTAGCCCTCCTGGACGAACCTTCCACCGGCATGGATCCTCGCAGCAAACGGTTCCTCTGGGACACCATACTGGCGAGCTTCCAGGGCAAAAAAGGTGCTATACTGACAACACATTCCATGGAAGAGGCTGATGCGTTATGTTCTAGAGTTGGGATTATGGTGAAAGGAGGTTTGAGGTGTATCGGTTCCACGCAACATTTGAAAAATTTGTACGGCGCGGGTTACACGTTAGAGATGAAGATAGGACAAAACAACCAGAAATCAACGATGTTAGAAACAGATCTGTCTATGACTCCATCACCTCTGCGGTCTCAAGACAACTCACCGTCCTTGGAAGAGGCTGATGAAGCTGGCTCCGGTCAAGGTTCTGGTGGGGGTGATTGTGGGCGCGAAGGGGATATTGAGGGTGAAGGTGAGGGTGAGGGTGAGGCTGTGGATGTCAGCATTCACACACCGCTGGTCGGGAACACACCGTCTATGAGACTGCATCATCAGCGTACTGAGTCTAGTGGTGGCGCGTGTGCTGAGGCAGCGATCGCGCTGGTGGGCTCACTGTTCCCGGCCGCCACGCTCGAGGAGAGCTTCGCTGAGCGTCTCGTGTTCTCTGTTCCACAGAGCTCCGTCTCCAGCCTCGCCAGCTGCTTCCAGCAGATAGAGGAGGCGAAAGAAAAGCTGAACATAGTGGAGTACAGTTTCAGTCAGACGACCCTGGAGCAGGTTTTCTTGAAATTTGCACAAACAGAGAACGTGGAAACATCAGACCAAGAACATTAG

Protein sequence:

>DPOGS207110-PA
MSDPNGILGRDDGRNSPASTAQLTKCDKEPDCVAIGSRLRGTMGTRPPAAFWPQLWATVVRNLLLKKRDTRKTLAEVLVPLYSLGVLIFLKMLVPNPNFPEVRKPGRLLRIHHDVISDNHSVAVVADWETANGTLSFLDDINSLLRESGQHQIHWIRYNNTTELNDAYHNNAKHFPLAIIFHTDPGAYGEPLRYTIRTNPSRDGGTPSTRTLSTSPAKCRERTKSDKDWSSDWSRGGQLIPLSEMHREDTCPVLQYYYTGFLALQTLIDYVKIKMETGANFLPPRVDFRQFPKRQHTGDWLVIFRVIMPMYMVMTLSQFITYLLMFVVGEKEKKIREGMRIMGLKDSVYWGSWFLIYAVFVTILSIVSTVLLFTLKVFQHSSYILIFLLMLLFGFTIITFAFMLTPFFDKARTAGILGSFAVNLMSGLYFIQVFVSNADSLAFWFVSLISSSCYALAMDKALVLDMAGVGVTWENLWSGPGVPFGGSLIMMALDTVLYGLAAYWLDAVIPGEYGIKQKPWFCLLPSFWSPRARVAQLLQDGNVTNNKDIEPVPKELQDKEAIRIVGLQKSFRHCRRPEVKAIDGIDLSIYEGQITAVLGHNGAGKSTLFNILTGLTSPTAGTAYVYGLDVRDPNDMHEIRQMIGVCPQQDVLFDLLSVKEHLQFFAAVKGIPRKRVPGEVQRALSEVGLLDQMHVFSKHLSGGQKRKLSIAIAFIGDPKIIILDEPTAGVDPVSRRQTWRVLQRARRGRVLLLTTHFMDEADILGDRKAVISKGRVRCAGTSLFLKNKFGIGYHLTLVLDGACREHQITRLVRGHVPRAEKARRHGRELSYILPHYTVHLFPPLFQAIELEIREKTNRLGITSYGVSMTTLEEVFLSLEGENAEETEAVEGVSSVKLVRARALSRSLSLQSKTLSYQELNDKEQQKTTSLPTPAASHALHSTTHGVEHVKVTPEAPVSVDALSEALVTRPSCWRTFCALVYIRTVRMIRDPYKLYVMIFMPIISCALGLYMKSRQIVFFRMQPLKLDPNAYFNKTPIALFSESDNMEQIDQFRDSLETFGAHPIDMFDGNFSSLLDMENFGAFSLKDSLVSFGNIMAYYNSTYTHSLPIIINLLDNTIYRVLMSSTNQLDNFRPIEVLTHPFQQTEQQEEFNLGNVVCAIFMGMIFALVPVTLAVDIVYDREIKAKNQLRVNGLSMSMYFLTYFTILIFIMVVTSAGVLALVMVNDIPSLTNGSAITMLCFLLLLYSPSAILFNTCLSYIFDKMDSAQSIMPNITTWVGVIPFILVAVLDTFKWGSNIAFYLHLVFSFLDVMYIPYAIIYYVDRVYLTCNLRGLCTVPDLSSYFTAEVWVLIAAMLLHVPVCGAALLAADRLKSGGRLWPRKSSTSEKADVEAEVADGLEDEDVRRERRRVGALLHAQRTQQDSTPPALLVHNLRKEYKLRNSRSGACCGEGECSRRAALARLSLAGPPRLRLVARTRKVQFSMTLAHKSGCI-