Monarch geneset OGS2.0

DPOGS201545
TranscriptDPOGS201545-TA4938 bp
ProteinDPOGS201545-PA1645 aa
Genomic positionDPSCF300006 + 1793305-1804562
RNAseq coverage17x (Rank: top 80%)
Annotation
HeliconiusHMEL0090444e-8061.45% 
BombyxBGIBMGA002728-TA2e-7357.41% 
DrosophilaCG1718-PB6e-1622.62% 
EBI UniRef50UniRef50_Q2WEC57e-2222.54%ATP-binding cassette transporter sub-family A n=1 Tax=Pecten maximus RepID=Q2WEC5_PECMA
NCBI RefSeqXP_001947916.15e-3422.41%PREDICTED: similar to AGAP006379-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|2608107687e-2525.64%hypothetical protein BRAFLDRAFT_118238 [Branchiostoma floridae]
NCBI nr blastxgi|1954351592e-2519.06%GK15524 [Drosophila willistoni]
Group
KEGG pathwaydre:5643449e-22 
 K05643 (ABCA3)maps-> ABC transporters
Orthology groupMCL20682 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201545-TA
ATGACCGCCGAAGAAGTAACTAAACCAATACGAAGCAAATTTCAACGGCACACAAATGCTCTTATCTGGAAATCGTATTTACAGCGGCAGCGTAGATGGGGTCTTCTTTGTGTGGAAACTCTCTTGGCTGTTGTTTTATATCTATCTGCTATTTTAATAGCGAAGCCGGTGTTTCTTACTCCTTTGCAAGCTATCCCACAGCCGCCTCTTACTGCGGCTGATATCTTGGCTTCCTTGAACAAGAAAAATATCCTGGGTTACGCTCCTAATGTTCCCCCTTTTAATGATATCATGAACACCGTTGGGGACAGTTTGGGAACTGAAATTATCAGAGCGCCTAAAGAGGATGACCTTAATAATATCCTGTACAATAGATCTAGAGGGGTCCCCTTAAATAACCCAGTTATTTGGATTCTTTGGAAGAAAACGGAGAACAACATGTGGAAATTCAGCATCCGCAGTACGGAGCGGGCTCAGTACGTGACAGGGAAGAAAATATCATCTAATCCACATTTGCGAAGTGGGATACTCGCTGTGCAGTTGGCAATCAGTCAGGCCATATTACAGTATGCTAGCGGAACAACGCCTGCATATCAGCTAAGCCTGGCATCAATGCCTGTGTCACCATTGATGGAACATGAACGAGTGCGTAAGCACATATCATCAATCTTGTTATGTTTCACGCTAGCACTGTTACCGCCGGTGTTAGAGACTCAGTCACTGGTTATACATGAAACGAGTCATAAATTTAAACGAGCCCTGCGTATTCGCGGTGTGGATTACTCTTCCATGTATATTGGCTGGTTAACTTACTTCTATCTCACGGCCCTACCCATCTGCCTACTCGCAAGCATCACGCTAATCCTTATCTTCCGTTGGGTCCATCTTCTGTTTACCTTTATACTTGTACTCGCATATGTCTCCGTGATGATTATGCTAGCTTTGATAATGGCCATGTTTCATACTCAAACAAAAGTTGCCGCAACATGGACGACATTATTCACTCTACTGCAGACGTTTCTTGCGGAGTTGATCGTACACCACGGCGTTGACCTTAAACATGAGATCCTGACCTTTTTCCTACATCTTATCATACCCCCTTTGGGATTAATGCACGGATTCAATGAATTCGCTTTATTGCAAACTGGACATGAAGGGCGCTGGGAGGGCAATTCGCTAGTATACACAATTTTGTTTTGGTTTATTATGATAGCATTGTATTTCGGTATTTTAATGATGTTGCAAAGAACTCTCAAAGCAGGGGCCATTGGAGGACAGGTTTCATGGAAATCTATCATTTTTAAATCTGTCGAGGATGTTTCTAAATTGCATCAAATTAAAAATCCTACGGGACGAGAGCACGAAAAATTACAAGAAGTCAATGAGTTGGCCGCTAAAGCCATAAGTATTCGCAATATGTCAAAGAGTATTATGGGAAATTCTATGCTGAGTAACATAACATTAGATATATACCGAGGTGAATTCACCATTCTTTATTCTGAATTCATTCAAAGTAAAATGATTCAGACCCTTGAAGACCTGTTAATGGGGCTTACTCATCCTGACGAGGGCTCTATAAACATATTAGGTGAAGAACTAACTCATGGAAAAACGTTTGTTAGTGTGCCTCATATGCTGGGTTACTGCCACCGCAAGGGTTGTCTGGTAGACGATCTCACTGTACAAGAACATTTCACGCTTTATTGTTCCTGCCGTGAATCAAGAATCGACCATTGTAACATGAAAAAGAATAAAGGTACTCACGATTTGAAAAAATATAAATATAAATTATTTATGGTGTCTGCAGCATTAGCTAAGATTTGTCTTTGGAACGAGTCTACGGATTACGTGTCCGAGTACGAGCATCTGAGGTCGATACGTTTGTTGGTAGAGTGTGAACTGAAGACAGTGTGCCATGAATACGTTCGCAACCTGGGCGTTTTTTACCAAGCTCAGCTTTGTTGGGCCATCGCTATGCTGTTAGAGCCGAGGATCATCATCATACCATCATTCGGTTATGAATCTGCATTTAATTCACAATACAAGAAATATATAACAGTCATCAAATTATGTTTTACGAGTATATTACTCGAATATGCCGACAGAGTATTTATCTTCGACAACAAGTTGCTAGTGTTTGGTGGATCACCCGCTTATATGTTTTTTAAATATGGTCGAGAGTACAGAGTTCGACTAACCCTCAAAAGTGGGAGCTTTTCAAGCGATGAAGACGTTAACGAGCTTCTCAAAAGGACCACAGAGGCTGGAGCGACAATAAGAGCTCACTTAGGCACTTTGCTCATACTCCGTTTGCCCGCTATTCCCACGGCTACCGTCGCGGCTCTGGTCAAAGACCTTAGCGAAAACTCAGATAAATATGAAATAGTGTCTATGAATATAAGTATACCGGATTCAGAAGAAGTTTGTAATCGAGCTATTTTTGAATCGAGGGCTAACATACATGGCAGCGAAGAACATCACCAAAGAATTAAAATCGCGTTAAACCATCTCGCTGAAGCGGCACCCATAAAGAGGCAGAAAAGTTACATAGGAAACTTAACTCATTTGAAATACACTTGGGCCAAATTTGTGGCTTACTATCGACACTATCGTCTATACTTTTTTGTAACTATTTTGTTGGCAATTACTAGCGGGATTTTCATTGGCCTTTCACTTGCAACTGTTCTGGGAGAAATAGAGAGAGATCGGGCAACGAAGGCTATTTTGCATGGAGAGGTATTAACGGTGGAAGCGCTGGAACAGAAGACAAATTTGATTCTACGTGCAGATAGCTCAAGTGAATCGAAATCAGTAGTAAATGCTTACGTCTTTTCTGAAACAATGGCAACGGAAAAACAAATAGAGAATATGTTTTATACCGCTCTTTCTGCACCCGAGAGCATTACGGAGTACTTGGTAACGCGAGCCATTGATTCCCCTCAACAATATGTATTTTTATACGCTTACGGCTTGGATGTCGCTGAAGTGAATGGATCTCTCAGAGTGCGGGTTCTGTACAGTCCTCTGCACACTGACCACAGCGCTGCCCCTCGTGCTCTCGCCAGAGCTTTCATGGCACTTCTTAGACACTATACCTCGGCTTTGGACGCCACCATAGAAGTTACCGACGACCCGCTAGCACTAGATCTGACGACTTATATGAAACAGGTTGCGGTTCCTCCAATTTTAATACAGTTTCTACTTATTCTAACCATAACCCACATTACTCTGATACCATCCAAAGAACATGGATTCATTAGGCATATGCAGAGTCATGCGAAGGATTTCTCACCCGCTCGGTATTGGTTGACGTTGTTTATATGCGACTTAGTTCTTTGCTTGTTCTTGGTGATTTTAATGACGGTTGCAATGATTGTTGTCATGGTATTCGTGGCGCCCATGACGTTCCGATACAGTGATTTAGATTTAAAACATGAGACGTGTTTTAACCCGCTTATACTTCTAGCGGTCGTGCCTACAATGCTGATAATTTATTGTGTAGGCTGTATACCGCAAGCATATTTGTTCAGCCTCGGACCTAACGCAGCTCTTAACGCTATGACTTTCGTAATGATAAACCTTCTTTTCAGTGAGACGACAGTTTTTGCAAAACTACTGTATGGGAACGCTCTTAATTACGCTTTGAATTTTATATCAGTTTCACCACAATTCAACATTGCTATCGCATTCGTGAAAATTGAGAAATGGATATATCCGAATAAAGAGCCGCGGGAAACTGAAACAATGGGAGTGCGGCAGGAGAGGAATAATGTAGTTAATATGAAAAAGCAATTGAAAAAGAAAATTCCCAAAAATAAGAAACAAGTTGAAACATATTTACTTGCACATAATATATCCCAACGACACTACGAGAAATCTATAATCCGAAATGTTTATCTCAGTTTGAGCAAAGGTGAGGTGTTGGCTATTTCGGGTGTCTTGGAACATGGAAGACTGCGGCTGTGTGAAATATTGGCTGGGTTCCAATTACCGACAGACGGACAGTTATGGTGCTTATCAAAATGGACTCTTCTTCGAAATCCTCATTTATTCAGTCAGCAGACGTCATTGAGCTGCGAACATAATCCTCTTCCGGGCTGGATGAGTGTGAGGAACGGTCTCGAGATGCTCGCCGTACTGCGCGGCGTCCCACGAAGCCAGGCTTTGGATCTGGTCATGGATTACGTCAACGCTATGGAACTTTACCACCAAGCCGACAGACAAATTTGCTACTTAAAAGCTAAAGACGTGGCGCGTTTGCACTTCGCGGCGGCCGTAATCGGTGCCCCTGAAGTTGTGATACTGGAGGAGAGTACAGAGTACCAAAAGTACTCCGTAAGACGAGCGATGTACCATATATTAAATAACCTGCGACAACGGGGGCATTCTATTATCATATCCTCGAGCAGTGTGGAGACTCTTTTACCCGTGTCAAATCGTTTGGCAATTTTAGTGAATGGTCGAATCTATGATATAGATCAGTTGGACAAACTCGTGGAACGACATAGCGAGAAGGGTTTTACTGTCGTGGTGCAACTGAAGTACAAAACCAATGTTACAAAAATATTCGTTAGATATTTTACAAAGTTTATCATCAATGACTCTTCGGAGAAACTAGTCAACTTCCAACTACTAGATGAAGATCTAACTTACGCGTCAGTATTTGAAAAGATGGAAATTCTCCAAGAGGAAAACGATGCAGTGAATTCTTACATAGTTTCTGCTACACCAATAGATTACATCTATAACTCTATAATAAGTCAAGAAGGCGGTCACATTGCTGAAGATCAAGAATCCCGTCTCAGCAAATACCTTTTCCGTCACAGGAAGCAGATAACTCCACCTAAAACTGTTTTAAACAAATTGATACCTTTCGAAACCCGATTTCATCTAACCAAGCTTAAGGAGTTGCCTTGGTCTGTAATATTTGACAAATAG

Protein sequence:

>DPOGS201545-PA
MTAEEVTKPIRSKFQRHTNALIWKSYLQRQRRWGLLCVETLLAVVLYLSAILIAKPVFLTPLQAIPQPPLTAADILASLNKKNILGYAPNVPPFNDIMNTVGDSLGTEIIRAPKEDDLNNILYNRSRGVPLNNPVIWILWKKTENNMWKFSIRSTERAQYVTGKKISSNPHLRSGILAVQLAISQAILQYASGTTPAYQLSLASMPVSPLMEHERVRKHISSILLCFTLALLPPVLETQSLVIHETSHKFKRALRIRGVDYSSMYIGWLTYFYLTALPICLLASITLILIFRWVHLLFTFILVLAYVSVMIMLALIMAMFHTQTKVAATWTTLFTLLQTFLAELIVHHGVDLKHEILTFFLHLIIPPLGLMHGFNEFALLQTGHEGRWEGNSLVYTILFWFIMIALYFGILMMLQRTLKAGAIGGQVSWKSIIFKSVEDVSKLHQIKNPTGREHEKLQEVNELAAKAISIRNMSKSIMGNSMLSNITLDIYRGEFTILYSEFIQSKMIQTLEDLLMGLTHPDEGSINILGEELTHGKTFVSVPHMLGYCHRKGCLVDDLTVQEHFTLYCSCRESRIDHCNMKKNKGTHDLKKYKYKLFMVSAALAKICLWNESTDYVSEYEHLRSIRLLVECELKTVCHEYVRNLGVFYQAQLCWAIAMLLEPRIIIIPSFGYESAFNSQYKKYITVIKLCFTSILLEYADRVFIFDNKLLVFGGSPAYMFFKYGREYRVRLTLKSGSFSSDEDVNELLKRTTEAGATIRAHLGTLLILRLPAIPTATVAALVKDLSENSDKYEIVSMNISIPDSEEVCNRAIFESRANIHGSEEHHQRIKIALNHLAEAAPIKRQKSYIGNLTHLKYTWAKFVAYYRHYRLYFFVTILLAITSGIFIGLSLATVLGEIERDRATKAILHGEVLTVEALEQKTNLILRADSSSESKSVVNAYVFSETMATEKQIENMFYTALSAPESITEYLVTRAIDSPQQYVFLYAYGLDVAEVNGSLRVRVLYSPLHTDHSAAPRALARAFMALLRHYTSALDATIEVTDDPLALDLTTYMKQVAVPPILIQFLLILTITHITLIPSKEHGFIRHMQSHAKDFSPARYWLTLFICDLVLCLFLVILMTVAMIVVMVFVAPMTFRYSDLDLKHETCFNPLILLAVVPTMLIIYCVGCIPQAYLFSLGPNAALNAMTFVMINLLFSETTVFAKLLYGNALNYALNFISVSPQFNIAIAFVKIEKWIYPNKEPRETETMGVRQERNNVVNMKKQLKKKIPKNKKQVETYLLAHNISQRHYEKSIIRNVYLSLSKGEVLAISGVLEHGRLRLCEILAGFQLPTDGQLWCLSKWTLLRNPHLFSQQTSLSCEHNPLPGWMSVRNGLEMLAVLRGVPRSQALDLVMDYVNAMELYHQADRQICYLKAKDVARLHFAAAVIGAPEVVILEESTEYQKYSVRRAMYHILNNLRQRGHSIIISSSSVETLLPVSNRLAILVNGRIYDIDQLDKLVERHSEKGFTVVVQLKYKTNVTKIFVRYFTKFIINDSSEKLVNFQLLDEDLTYASVFEKMEILQEENDAVNSYIVSATPIDYIYNSIISQEGGHIAEDQESRLSKYLFRHRKQITPPKTVLNKLIPFETRFHLTKLKELPWSVIFDK-