Monarch geneset OGS2.0

DPOGS213147
TranscriptDPOGS213147-TA6810 bp
ProteinDPOGS213147-PA2269 aa
Genomic positionDPSCF300016 + 1088656-1104291
RNAseq coverage892x (Rank: top 14%)
Annotation
HeliconiusHMEL0103350.090.20% 
BombyxBGIBMGA007906-TA0.084.12% 
Drosophilapoe-PA0.048.32% 
EBI UniRef50UniRef50_F4WKS30.056.00%Protein purity of essence n=6 Tax=Coelomata RepID=F4WKS3_ACREC
NCBI RefSeqXP_972816.20.054.17%PREDICTED: similar to f14p3.9 protein (auxin transport protein) [Tribolium castaneum]
NCBI nr blastpgi|3320249780.056.00%Protein purity of essence [Acromyrmex echinatior]
NCBI nr blastxgi|1571179390.051.77%f14p3.9 protein (auxin transport protein) [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL10608 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213147-TA
ATGGCGTCCACGTCTAGAGCTGCGGAAGCCGAGAATGAGAGTCGTAGATTACACGCGCTCCGCCTGTCACTCTTATCAGCGGCGTTGGACACGATACCCACACTGAGGACACTACCCGGTGTACGCGCGATTCCCTTTATTCAGGTCGTCCTGATGCTAGCCGGTGATCTCGACTCCAGTATAGAAGCAGATCGTGCGGTGTTGGACCGTTTACTCGAAGTTCTAGTAGCCGAGTTGGACATACAGCCGGAAGAATCTGCCAGTGGTGAAGATCGCACTGAACGACGCGAGTTGCAGTTGGCAATCATGAGATTGGAACTGCTGGTTTCCGAGTTAGATGCACAGACAGAAGAACATTCACCTCCTGTACACGAACGGACTAACAGGAGGGAGCTACAACTTATAATCATGAGATTACTGAGTGTACTGATGGCTCGCTGGAAGAGCTGCGGAGGTTCGGGCGGCGTGAGCGGCAACGCGGTGGCGCGGGGCGAGGGGGGCGGCGCGAGTGCGGCACACGTGTCGCGTCTCGCCGCCGCCGCGCTCGTGCGGGCCGGAGCCCCCGCACACTGCTACAACGTCCTAGCCGCGCTGCTACCCTACTGGAAAGAGAAAACCAGCAACTCCAGCACAACAACGACCGTCCAGCCGCTACTGAAACCACAGCCACCACAACCACTACCTGACATGCAACCCTTCTTTGTCAAAGAGTACGTTAAAGGTCACGCTTTGGACGTTTTTGACAATTATCCACAACTGGTGATGGAAATGGCTCTTCGCTTGCCGTGTCAGATACACAAGCACTGCGATCCAAAACACTTCGATGCTCGCTGGCGCACCCTCCTCTGCGAGTACATGATGAACCAACAAACTCCACTCAGAAAACAAGTACGTAAACTACTGTTGCTTCTGTGCGGAACCCGAGAGCGATATAGACAACTTCGTGACGTACACGCTCTCGACACACATCTGAACGCGGCACGCGCCCTGCTGGCCACGGATCCTGCGTATGACAAGCTCGTTAGACTCATGGAACATTTGAAGGCATGCGCGGAGATAGTATCGGCTCGTACTGGCAACTGGCAGTTGACGTGCAGCAGCGAGCGGCGGGAGACGTTGTCGTGGTTGGTACGAGTAGCACGGCGCGTGCACCCTCACGTGGCGCCCACAGTACTACAGCTACTGCAGGCGGCTCTGTGTGCCCCCGCTGCACCCACCCCCTCCACGCCCGTGGGACCGCCCACTTCCGTCGCGAGCGCTGCTGCCTCGAATAACACAAAGAACACCTCTGAATGGCCAGAGCGAGAACGTAGCACCGAGAGCGACGCCTTCGTATCAGATGGCTCCAAATTCCAAGATCAAAGAGTGCCGCAACTTGTACAACAGATTTTGAAACAGGTCAGTCAAGATGAACTGAGACTGTTCGTAAAGGCATTCCTACTGGAAACGAATTCGACTGCCGTTCGCTGGCAAGCTCACGGTTTACTTCTCGCTATTTACAATAATTGCTCTCAAACCGAGCAGTCATCCATGGTGTCCCTGTTGTGGGGGTTGTGGCCTTCCCTGCCGCAGTACGGTCGCAAGGCAGCCCAATTCGTAGACCTACTTGGATATTTCACTCTGAAGACGCCCAATATTGACACTGAGAAGTACCTCGGTAGCGCCGTGGAACTGCTCCGCAATCAAAACTATCTACTGTCATCTCACGGTAACGCGTCCTTGTACGCCGCGCTGGGAGCGTTCGTTGAACTCGACGGATACTTCTTGGAGTCCGAGCCGTGCCTCGTGTGCAACAACCCGGAGGTGCCCATGGCCACCATCAAGCTACCCACAATTAAGATCGACTCGAAGTTCACGACCACGACTCAGATAGTGAAGCTCGTGAACAGCCACATGATCAGCCGCATCAGCCTCAGGATAGGAGACATCAAGCGCAGCAAGATGGTGAGGACCATCAACTTCTACTACAACAACAGGACGGTGCAGGCTGTGCAGGAGCTCAAGAACAAACCGGGTATGTGGCACAAAGCCAAACGTGTGCAGCTGCAGTCCGGTCAATCCGAAGTCCGTGTAGACTTCCCCTTACCGATCGTCGCCTGTAACCTCATGATGGAGTACGCTGACTTCTATGAGAATCAGCAGGCTACGGGCGAATCACTGCAATGTCCGCGATGCTCTCAATCCGTGCCAGCGAACCCCGGAGTGTGCGCTAACTGCGGAGAAAATGTATTCCAGTGCCACAAGTGTCGCGCTATAAACTACGACGAGAAGGATCCCTTCCTGTGTCACGCGTGTGGGTTCTGCAAGTACGCCAAGTTCGACTACACGCTCACAGCGAGACCGTGCTGTGCCGTGGACACCATAGAGAACGACGAGGAACGGAAGAAGATGGTTCAGACTATCGGGGCACTGCTCGATAAAGCCGACCGCGTGTATAGACAGCTCATAGCTAACAAACCTGTTCTAGAGTCGCTAGTACACAAGATCAGCGAACATCGCGTAGAAGGGCGCGCGGATGAGAACAGCAACGGAGCGAGCGCCGCGTTCGGCGGAACACAGATAAACCGCGTCATACAGACGCTAGCGCACAAGTACTGCGTCGAGAGCAAAGGACACTTCGAGGACCTGTCCAAGATCATACAGAAAGTGCTGGCCTGCAGGAAAGAGCTCGTGGCTTACGATAGGACACAAAGCGAACAACAGAAAGGCGATACCCTACCCGTGTACGCCGGTCTCCTGCAAAACTATGACGGAGATGTTACAAAGGAATGTGGAGGTGGCTGCTACGGGTGCAGCGTGGCGTGCGCTGAACAGTGCCTGACGTTACTCCGAGCGCTGGCCTCGCAGCGAGAACACCGAGCGAGACTGTGTCGCTTCGGGCTAGTTCGCGAACTTGTCCAACATAACCTGCACCGCGGGACACCGCAGTGTCAAGAAGAAGTACGTGCACTCATCTGTCTTGTGACCCGAGATAATTTACCAGCAACTGAACAGTTGTGCAATTTGCTCTCTCAAAGGATAACACTCTCTTTGATGGGTCACGCAGCATCACAGGATCATAACAACTCGGTTCGACCGCTGGTTTTATTACTTGGGTCTTTGGTTAAGGTCCAAGACTCTATTGACTGCTGGGAGGTCAGATTGCGCTGTATAGTGAAGCTATGGGTTTGGTGCTGTCCCCAATTGACGGAGGTTGTCGGTATACCTCCCGCAGCGAGTGCAATTTTGAAGGCTGAAGCTCTAGCCGGTATTCCGGGAATAAACACCTCTTCGCCGAACTCACACCAATTCGCGCTACAACAAGTGGCGCTACCCTGTCTGCGGTATATGCAGGAGTTGATGGCGCCTCCACCTTCGGCGCTACCTCCGCCATCGGCGCCCACGAATGAAGAACAAGAAAAGGAAGTGACAAACAGTCTGCCCACCAGCGCCGGAAACATGGTGGTGGATCTATCAGCATGGCTGGCGGGGAAGGTGCCGCACGCGCAGTGGCGACGCCTGGTGGGAGCTGTCGAACCTCCGCCGAGCGATTCAGCGATACCAGCTCGCGATCTACACCTTGCACATAAATACCTCGGCAAGTGGAAAGAACGGATGTTGCTCTCCCATGGCATGCGACCTTTGGCCCTGGACGAGGGCGGGTGGCTGCGGCCCGTCATGTTCGACCCCAGCTCTCGCATCGCCAGGGACACAGCATGCCAGATGGTGAAGAGTTTATGCGATTCCTACGAAAGGACGAAGGCTGTCCTGATATTATTGACGAGCTTCCTTCCCGAGGTCGGAACTGCCGGAGAGGCGAGCGAACAGTTTCTGCAACTCTATCAGAGTCTGGCGTCCGAGGCTCCTTGGAAACAGTTCCTGGCTTTACGTGGAGTGCTGCAACAGATCGCCGACCTTATGACCAAAGAGATAGATCAACTGCATCGTCTCGAAGAAACCACGCTAACATCCGATCTCGCTCAAGGTTACGCTCTGAAGCGTCTGACGGAGCTGCTAGCGATGTTCCTGGAGGAGCCGGGCGCCAGACGTACTTACAAGGGCCGTCTTGTTGGGGCCGTGCTGGGCGGATATCTGTCACTCAGGAGACTTGTGGTACAACGCACCAGGCTCACGGACGACACGCAAGAGAAGCTTCTAGAGCTGTTGGAAGAAATGACTACCGGTACGGAGACAGAAACAGCCGAGTTTATGGCGGTGTGCATAGAGACGGTCCAGAAGTACCCGCTGCAGGACTATCGTACACCTGTGTTTATTTTCGAGAGGCTCTGTTCCATCATCTACCCGGAGGAGAACGATGTGGCAGAGTTCTTCCTCACACTGGAGAAAGATCCACAACAAGAAGACTTCTTACAGGGTCGCATGTTGGGCAACCCGTACTCGTCTTTAGAACCAGGAATGGGTCCTCTGATGAGAGATGTGAAGAATAAAATATGTACCGACTGCGAGTTGGTGGCACTCCTGGAAGACGACAACGGAATGGAACTACTCGTGTGCAACAAGATTATGTCTCTGGACTTACCGGTCAAAGAAGTATATAAGAAGGTGTGGTGTACATCTGGCGAGGAAGTGGACGCCATGCGGGTCGTGTACCGCATGAGGGGGCTGCTAGGAGACGCCACCGAGGAGTTCGTCGAGACGCTGAGCCAGACCAACGCTGAGACCGTCGACGACGAACAAGTATACCGCATGGCTAACGTGCTCGCTGACTGTGGCGGGCTGGAAGTCATGCTCCAGCGGCTGGCGGCCATAGGACGCGTGGGGTGCGCGAGGTCACTGGTGTCGACGCTGTTGCGTCTCCTGGCTTTGTGTGCTCGCGTGCGACGTTGCGTGCGCGTGCTGACCCGCCCCGAGGCGCGCGCTCTGCCCGTTCTATTGCACGCACTCAGCCTGGCCGCCGCAGACGAGCGAGACGTTCAACGCGCCCCGCTCGTCTATCAGCTGTTGGAGATAATGGAACGTATTCTATCGGTGGCCGCGAGCGAAAGTCTCGAATCTTTTCTACAATTTTCGCTCACCTTCGGCGGACCGGAGCACGTTCAGGCTTTGCTCAACTGCACGGAATGTCCAGGTATCCGCAACAACTCTGTGGCCCTGGGTCACCTGACTCGTGTTCTGGCCGCGCTAGTTTACGGCAACGATCTCAAGATGGCGATGCTAGTGGACCACTTCAAGCCCGTCCTGGACTTCGATCGCCTGGACTCGGAGCAGTGGAGCGAGGAGGAGTTCCGTATGGAGCTGTTCTGTGTCCTGTGCGCTAACATAGAACGGAACTCTATCGGTGGAACCCTCAAGGACTACTTGATATCGCTCGGCGTTGTTCGCGACGCGCTCGAGTATATTGTGAAACACGCTCCGTGTGTGAAGCCAACGTTAGTCTTAACCGACTCTGACGAACTGAAGGAATTCATAAGTCGCCCGGCGCTCAAATACATCTTGCGCTTCCTCACCGGACTCGCCGCGGACCATGAACCCACTCAGATGCTGGTGTGCGAGAAAGTTATTCCGATCGTGCACCGTCTGGAACAAGTGTCGTCGGGGGAGCACGTGGGCTCGCTGGCCGAGAACTTGCTGGAGGCGCTACGATCGCAACCGCAGTGTGCGGCCAAGGTGCAGCAAGTCAGAGAGTTCACGAGACAGGAGAAGAAGCGTTTGGCGATGGCGGTTCGCGAGCGGCAGCTGGGCGCGCTCGGCATGCGCAGCAACGAGCGGGGCCAGGTGACGGCGCAGTGCTCGCTCACACAGCAGGTGGCGGACCTGGCGGAGGAGGCCGGGGCCGTCTGCTGCATCTGTAGGGAGGGATACAAGTACCAGCCGACTAAGGCAAGTACACAAGTGTTAGGTATCTACACGTTCACGAAGCGCTGTGCGGTGGAGGAGTACGAGGTCCGCGCTCGCAAGACGCTCGGCTACACCACCGTGTCCCACTACAACATAGTACACGTGGAGTGCCACATGGCGGCCGTGAGGCTAGCACGCGCCAGGGACGAGTGGGAGAGCGCCGCCCTACAGAACGCGTCGACTCGTTGCAACGGTCTGTTGCCGCTGTGGGGTCCGCATGTCCCGGAGGCAGCCTTCGCCTCGTGTCTGGCCAGACATACCACCTACTTGCAGGAGTGTACCGGCCACCGCGACATCGGCCACGCGTGCACCGTCCACGACCTGAAGCTGCTGCTGGTGAGGTTCGCCCGCGGACGGACCTTCCACGACGACACCGGCGGCGGCGGCCCGCTGTCCAACATGCAGCTGGTGCCCGCGCTCATACACATGGCGCTATACGTTATTAACACTACCCGGGTGGCTGGTCGCGAGGTGTCGTCCCTGGACGCGAGCGTGTCGTGGGCGGCCGGCCGAGCGCTGGAGGCGGCTCACGAGGCGGACGGCCCGCTGTATTACCTCACACTCATGCTGTTACTGTATCCGATCAACAGGCGAGCATGGCGCTCCCTCCGCTTGGACATGTTAAAGCGCCTGCTAGTGACTGGTCACGTGCGCGCCGTGTGTCCCGGGGGACCCGCGCTGAGGGCTCTGTCCGCGGAACAGAGAGCCGCTCGACCCTGGACAGACTACAAGCCGTACGCCATCTTCAACGTGAACGCGACGACAGTGGAACAGTGGCCCATCAGGCTGGCGGAGTACATCCGGCACAACGACGAGGCCAACGCGAAGGCGGCCGAGCGCATCGTGACGACCCTCACGGACGAGCTGCTGCCGTGCGCCTCCTTCGCCGAGTTCTGCGACGCGGCGGGCTTCCTGGACGACATCCCCGACCCCGACTCCTTCCTCCAGACCCTCATAGACGAGCAGCCGTGA

Protein sequence:

>DPOGS213147-PA
MASTSRAAEAENESRRLHALRLSLLSAALDTIPTLRTLPGVRAIPFIQVVLMLAGDLDSSIEADRAVLDRLLEVLVAELDIQPEESASGEDRTERRELQLAIMRLELLVSELDAQTEEHSPPVHERTNRRELQLIIMRLLSVLMARWKSCGGSGGVSGNAVARGEGGGASAAHVSRLAAAALVRAGAPAHCYNVLAALLPYWKEKTSNSSTTTTVQPLLKPQPPQPLPDMQPFFVKEYVKGHALDVFDNYPQLVMEMALRLPCQIHKHCDPKHFDARWRTLLCEYMMNQQTPLRKQVRKLLLLLCGTRERYRQLRDVHALDTHLNAARALLATDPAYDKLVRLMEHLKACAEIVSARTGNWQLTCSSERRETLSWLVRVARRVHPHVAPTVLQLLQAALCAPAAPTPSTPVGPPTSVASAAASNNTKNTSEWPERERSTESDAFVSDGSKFQDQRVPQLVQQILKQVSQDELRLFVKAFLLETNSTAVRWQAHGLLLAIYNNCSQTEQSSMVSLLWGLWPSLPQYGRKAAQFVDLLGYFTLKTPNIDTEKYLGSAVELLRNQNYLLSSHGNASLYAALGAFVELDGYFLESEPCLVCNNPEVPMATIKLPTIKIDSKFTTTTQIVKLVNSHMISRISLRIGDIKRSKMVRTINFYYNNRTVQAVQELKNKPGMWHKAKRVQLQSGQSEVRVDFPLPIVACNLMMEYADFYENQQATGESLQCPRCSQSVPANPGVCANCGENVFQCHKCRAINYDEKDPFLCHACGFCKYAKFDYTLTARPCCAVDTIENDEERKKMVQTIGALLDKADRVYRQLIANKPVLESLVHKISEHRVEGRADENSNGASAAFGGTQINRVIQTLAHKYCVESKGHFEDLSKIIQKVLACRKELVAYDRTQSEQQKGDTLPVYAGLLQNYDGDVTKECGGGCYGCSVACAEQCLTLLRALASQREHRARLCRFGLVRELVQHNLHRGTPQCQEEVRALICLVTRDNLPATEQLCNLLSQRITLSLMGHAASQDHNNSVRPLVLLLGSLVKVQDSIDCWEVRLRCIVKLWVWCCPQLTEVVGIPPAASAILKAEALAGIPGINTSSPNSHQFALQQVALPCLRYMQELMAPPPSALPPPSAPTNEEQEKEVTNSLPTSAGNMVVDLSAWLAGKVPHAQWRRLVGAVEPPPSDSAIPARDLHLAHKYLGKWKERMLLSHGMRPLALDEGGWLRPVMFDPSSRIARDTACQMVKSLCDSYERTKAVLILLTSFLPEVGTAGEASEQFLQLYQSLASEAPWKQFLALRGVLQQIADLMTKEIDQLHRLEETTLTSDLAQGYALKRLTELLAMFLEEPGARRTYKGRLVGAVLGGYLSLRRLVVQRTRLTDDTQEKLLELLEEMTTGTETETAEFMAVCIETVQKYPLQDYRTPVFIFERLCSIIYPEENDVAEFFLTLEKDPQQEDFLQGRMLGNPYSSLEPGMGPLMRDVKNKICTDCELVALLEDDNGMELLVCNKIMSLDLPVKEVYKKVWCTSGEEVDAMRVVYRMRGLLGDATEEFVETLSQTNAETVDDEQVYRMANVLADCGGLEVMLQRLAAIGRVGCARSLVSTLLRLLALCARVRRCVRVLTRPEARALPVLLHALSLAAADERDVQRAPLVYQLLEIMERILSVAASESLESFLQFSLTFGGPEHVQALLNCTECPGIRNNSVALGHLTRVLAALVYGNDLKMAMLVDHFKPVLDFDRLDSEQWSEEEFRMELFCVLCANIERNSIGGTLKDYLISLGVVRDALEYIVKHAPCVKPTLVLTDSDELKEFISRPALKYILRFLTGLAADHEPTQMLVCEKVIPIVHRLEQVSSGEHVGSLAENLLEALRSQPQCAAKVQQVREFTRQEKKRLAMAVRERQLGALGMRSNERGQVTAQCSLTQQVADLAEEAGAVCCICREGYKYQPTKASTQVLGIYTFTKRCAVEEYEVRARKTLGYTTVSHYNIVHVECHMAAVRLARARDEWESAALQNASTRCNGLLPLWGPHVPEAAFASCLARHTTYLQECTGHRDIGHACTVHDLKLLLVRFARGRTFHDDTGGGGPLSNMQLVPALIHMALYVINTTRVAGREVSSLDASVSWAAGRALEAAHEADGPLYYLTLMLLLYPINRRAWRSLRLDMLKRLLVTGHVRAVCPGGPALRALSAEQRAARPWTDYKPYAIFNVNATTVEQWPIRLAEYIRHNDEANAKAAERIVTTLTDELLPCASFAEFCDAAGFLDDIPDPDSFLQTLIDEQP-