Monarch geneset OGS2.0

DPOGS203584
TranscriptDPOGS203584-TA4395 bp
ProteinDPOGS203584-PA1464 aa
Genomic positionDPSCF300063 - 1091466-1100378
RNAseq coverage856x (Rank: top 15%)
Annotation
HeliconiusHMEL0158680.078.35% 
BombyxBGIBMGA001380-TA7e-15068.04% 
Drosophilachb-PB0.036.47% 
EBI UniRef50UniRef50_Q7QJR30.039.48%AGAP007623-PA n=4 Tax=Culicidae RepID=Q7QJR3_ANOGA
NCBI RefSeqXP_308248.40.039.48%AGAP007623-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582853310.039.48%AGAP007623-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582853310.039.19%AGAP007623-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00054885.3e-40binding
KEGG pathway 
InterPro domain[8-990] IPR0160245.3e-40Armadillo-type fold
[8-221] IPR0119896.5e-22Armadillo-like helical
Orthology groupMCL11440 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203584-TA
ATGTCTCACTATCCCCAACCGTCCTCCCTGGAGGCCGCCCTGCCGCTGCTGTCTCGGCCGGACCTGCGTCTCCGGCAGCAGCTGGGGGAGCGTCTGGTGAGCCTGGTGCGCTCGGAGGAGCTGACCCCCGACCTGAATCCCGATCTGACCGGACCTCTGCTTGACGCGCTCGTCGGTTGGCTCAACGGAGGGAACTTTAAGGTCGCCCAGAACGGTCTGGAGGTGATGTCGGCCCTCGTTGAACGTATGGGCCCCGAGTTCTCACACTACGTGCCCACAGTGCTGCCTCACATCATAGACAGGCTGGCGGACACCAAGGAGGGCGTGCGGGTGTCGGCCCGCGCCTGCATAGCGACCCTCAGCTCGTGCAGGGCGGCGGCGCCCAGGGCCATACTCGCGAGACTCACTCCAGCGCTGGGGCACAAGGCGGCGCACACCCGGGAAGAGGCGCTCACCTGCATAGGAACGCTGCTGCATGAACACGGCGCAGCGGAGTTACAGCTGCGGGGTGCGGTCCCGCAGGTGGCGGCGCTTCTAGGAGACCCCAGCGGGGCCGTCAGGGACGCCGCGCTCGAACTCATCGTGGACGTCTACAGACACGTCGGGGAGAGGCTCCGGCAGGACCTGCGGAGGAAGGACCTCGTGCCGCAACAGAAGATGGCGCTGCTGGAACACAGGTTCGACGAGGCAAAGGAGGCCGGCCTGCTGCTGCCCTCGGCGTTGGGCGCGGACGAGGCGGACTGCGCTCCTCGTGTGAAGCGCGCCCTCACGCTGCCCACGCCGAGAGGACGAGAAGATACTTCAGGATCCAGTACACCCGCGGGTGAGCCGAAGCGTACAGCGGCGGGTCTGTACAGCCTGCCGTCGGCGAGCAGGAAGCCGCCGCCGCCCACCAAGCTGAACAGCGCGCAGAGCGCGAGCGGAAGCGGAGGCGGAGGGGTGGGCGGCGAGGCGGGCGCGGTGTCCAACGAGTCGTTCGAGGCGGCGTTCGCGAGCACGGCCCCCGCCGCCGTGTACGGAGCCCGCGGCCTGGACGACCTGTGCCGCCACGCCGCCGCCCTGCTCGGGGACCGCGCCGCCGACTGGGAGAAGAGGGTCGACGCTTTGAAGAAGATCCGCTCGTTGTTAGCGGCGAACGCACACGTTCAGTTCCCGTCGGAGTTCGCGGCGCACCTCAAGGACCTGTCGGTGCCGTTCCTGGTGGTCATCAAGGATCTGCGCAGCCAGGTGGTGAGGGAGGCCTGCATCACCATCGCCTACATGGCCAAGGTGCTGCGGAACAAGCTGGACCAGTTCAGTCTCTACATACTACAGGAACTCATCAACCTCATACAGAACGCCGCCAAGGTGGTGTCGTCGGCGGGCACGGTGTGCGTGCGGTACATCGTGCAGCACGTGCCGGCGCCGCGCCTGCTGCCGGTGCTGGTCACCAACCTTACCACTCACAAGAGCAAGGAGATCCGCGCCACGCTCAGCGAGGTGCTGCTGCTGCTGCTGCGCTCGTGGCCGCGGCCGGCGCTGGACCGGCACCAGGCGGCCATCGCGGACGCCATACGAAAAGCCTGCGCCGACGCCGACAGCACGGCCAGGAACAACGGCAGGAAAGCGTTCTGGTCGTACAAGAGTCAGTTCCCGGAGCAGGCGGAGGCGCTGTTCAGTCGCATGGACGTGGCCGCGCAGAAACAGCTGGAGAGAGACAAGGCCGGCTCCATGGACGGACTGCAGCAGTTGATACCCGAGAGAACTAGAACCATCACCAGTCCGCGGAGTCCTTCGGCCAGCGTGTCCGCGTCCACTGAGAGCCTGGTGTCGGTGGTGAGTCGCTCGGGTTCTCTCCGCCGGCGCCGCTCGTCCCAGGAGCGTCCGCCCATCTCTCACATCCCGGTGGCCATGCGAGATCGTTCGCCGGCCACCGGCTGCGTGTCGTCCCGGTCCGTGTCGGCGGTGGACGCGGCGGCCGCCCAGCGAGCCCGCGCCCGAGCTGTTTACTCGCACCTCGCTAGGACTAAAGTCGCCGCCGGCAGCGCCAGTCTGCCCCGCGTCAAACGTTCCCCGGTGGCGGCGGTGCCCCCCAGCCCTGAGCGGAGCGTGCGGTCCAGGTCGCGCCCCGGAGTATCGCAGTCACAACCGACGTCCCGGTCGTCGTCTCCGTCGTCTCGCAGCGGTCCCGTGTCGCTGTCGTGTCGCCGCCGTCCGTCCGGGATCCCCAGGTCGCTGGCGGGGTCTCGAGAGACCAGTCCGACCCGCACCCGCTCCGGCAGCCTCGCTCGTCGCCGGGACTCCGTGGACCGCCGCCCCCCCGCCGCGACGCTGCGACTCCTGCAACAGTCCAGAGACGCCGAGAACGCGCTGCGGAGTCCCGAGGACAGCTCGTCGTGTGAGGTGCGGCGAGACGATGACTCGGAGGCGTCCTCCGTGTGTAGCGAGCGGAGTCTGGACTCGTACAGGAGACACGACAGCGTGTCCTGGTCGGGGTCCAGCCGCCTGGTATGGGAGGGGTCTCCGCCGCCCCCGCCTCCCGCCGATGACGTCATAGCACTGTGCGCCTCGACACACTGGACCGAGAGGAAAGACGGACTGACGCATCTAGCGAACTATCTGAACAGCGGGCGACTGCTGACGGAGGACCAGCTGAAGAGGCTGACGGACTTACTCAACAAGATGTTCGTAGACGCTCACACAAAGGTGTTCTCCCTGCTGCTGGACGCCGTCTGTGAGCTGCTGCTGGTGCACTGGCAGCAGCTCAGGGACTGGCTCTACCAGCTCATGTTCAGATTACTGATGAAGCTCGGCACAGACATCCTGGGCTCCGTGCAGAGTAAGATCATGAAGACGCTGGACGTCATACACGAGTGCTTCCCCGCGGAGCTGCAGCTGCATAACATATTCAGGTTCCTCGCGGACGGAGCGACGGCCCCCACCGCCAAGACGAAGGCGGCCGCGCTGCGCTTCCTGGCCGACCTCGCACACGACTACTGCACGCCCGCGGGCCTCGCCGCCGCCTTACACGGCGGCTCGTCCGGCGTGGCGGGTCGAGCTCTAACGAAGGTGGCGGTCCTGGCGGGAGACGCCCGCGCGGGCGACGTGCGCCACTGCGCCCGGCGCGCCCTCGCCGCTCTGTATGATTGTAACCCCGGCCCCTTCACGACTCTCATGTCGGAGCTGTCGGCCGACACGCAGGCGCTCGTCAACGGGGTGGTGCAGCAACACGTCAGGAGGACCTCCAGCACCGGCAGCGACAGTCCGCTGAGGTCGACGGCCGCGCCCGACGACGTCTACAGCCGGATACGGAAGACGACCAGCGAGATACACACGTACACCACGCAACACGCGAACGCGGAGTGCGCGAGCCACGCCTGCAGCAAGGACTCGGGGATCAGTCAGATGTCGGAGCGGCACAACGGGCACGCGCATGAGGCGGTGGCTCGAGCGGCGTCCGCGGACAGCTCGGAGGCGAGCAGCACCAAGGAGTCCTCGCCGGGCCCGCACCGACCCGACTATCACGGAGAATACAACGCGACCGCCGGAAACAACAACAGAGACAAGATGAAGCCCTACGAGATGGACGAGAACGGGATGATCATCACCAAGTCCGGACTCCGCGAGAGTGAGGTGCTGGAGGCGCTGTCCTCCCTAGACGTGGCGGCGGCGGCCCCGGAACACACGGAGCGTTTGCTGCTGGCCACGCACGAGGTGCTCAAGTACGGAGACTGCCGGCTGCCGCTGGAGTACTTCAAGAACATCGTCCGCGCCGCGCTCGCCGCTCTCTCCATCGACGACAACTCGGCAGACAAGGAGAACGCCGAGAATGCCACCAACACACAGCACGCCTCAGGGTGGGGCACGGCCCAGGAGCGGGCGGCGGCCGAGGCGGTGCGCGTGCTGGTGTGGCTGTGTCGGCGGACGGAGACGCGTGCGCTGTGGGCGGAGTACTTCGACCTCATCCTGCTCAAGCTGATCAACGCGTACGGAGCCTCCAGCAAGGAGGTCATGAGGGCCGTGGACGCGGGCATGACGCACATCGCACACGCGCTGCCGGCGGCACAGGTGCTGGCGCTCCTGAAGCCCGTGATCCGGACCCGCGGGTACCCCACGTCTCTGTGCGCTCTCAAGCTGGCGGCCGAGGTGGCGAAGGCTCGAGGAGACGAACTGACGGACGAGACAGTGGCGCAACTCATGGAGGGAGTCGGGCAGCTGGCCGACCACCAGAACTCTGCGGTGCGCAAGGCGGCCGTGTTCTGTATGGTGGCCTTCACGTGCGCTCTCGGCGACGAGCGGATGACGCCCCACCTGAAGCACCTGTCCGTCAGCAAGTACCGCCTCCTGCAGGTTTACATCAGTAAGCAGCGCGAGGAGTCCTCTCGGCCCCCTCCACCCTCCTCCACACACTCGTAG

Protein sequence:

>DPOGS203584-PA
MSHYPQPSSLEAALPLLSRPDLRLRQQLGERLVSLVRSEELTPDLNPDLTGPLLDALVGWLNGGNFKVAQNGLEVMSALVERMGPEFSHYVPTVLPHIIDRLADTKEGVRVSARACIATLSSCRAAAPRAILARLTPALGHKAAHTREEALTCIGTLLHEHGAAELQLRGAVPQVAALLGDPSGAVRDAALELIVDVYRHVGERLRQDLRRKDLVPQQKMALLEHRFDEAKEAGLLLPSALGADEADCAPRVKRALTLPTPRGREDTSGSSTPAGEPKRTAAGLYSLPSASRKPPPPTKLNSAQSASGSGGGGVGGEAGAVSNESFEAAFASTAPAAVYGARGLDDLCRHAAALLGDRAADWEKRVDALKKIRSLLAANAHVQFPSEFAAHLKDLSVPFLVVIKDLRSQVVREACITIAYMAKVLRNKLDQFSLYILQELINLIQNAAKVVSSAGTVCVRYIVQHVPAPRLLPVLVTNLTTHKSKEIRATLSEVLLLLLRSWPRPALDRHQAAIADAIRKACADADSTARNNGRKAFWSYKSQFPEQAEALFSRMDVAAQKQLERDKAGSMDGLQQLIPERTRTITSPRSPSASVSASTESLVSVVSRSGSLRRRRSSQERPPISHIPVAMRDRSPATGCVSSRSVSAVDAAAAQRARARAVYSHLARTKVAAGSASLPRVKRSPVAAVPPSPERSVRSRSRPGVSQSQPTSRSSSPSSRSGPVSLSCRRRPSGIPRSLAGSRETSPTRTRSGSLARRRDSVDRRPPAATLRLLQQSRDAENALRSPEDSSSCEVRRDDDSEASSVCSERSLDSYRRHDSVSWSGSSRLVWEGSPPPPPPADDVIALCASTHWTERKDGLTHLANYLNSGRLLTEDQLKRLTDLLNKMFVDAHTKVFSLLLDAVCELLLVHWQQLRDWLYQLMFRLLMKLGTDILGSVQSKIMKTLDVIHECFPAELQLHNIFRFLADGATAPTAKTKAAALRFLADLAHDYCTPAGLAAALHGGSSGVAGRALTKVAVLAGDARAGDVRHCARRALAALYDCNPGPFTTLMSELSADTQALVNGVVQQHVRRTSSTGSDSPLRSTAAPDDVYSRIRKTTSEIHTYTTQHANAECASHACSKDSGISQMSERHNGHAHEAVARAASADSSEASSTKESSPGPHRPDYHGEYNATAGNNNRDKMKPYEMDENGMIITKSGLRESEVLEALSSLDVAAAAPEHTERLLLATHEVLKYGDCRLPLEYFKNIVRAALAALSIDDNSADKENAENATNTQHASGWGTAQERAAAEAVRVLVWLCRRTETRALWAEYFDLILLKLINAYGASSKEVMRAVDAGMTHIAHALPAAQVLALLKPVIRTRGYPTSLCALKLAAEVAKARGDELTDETVAQLMEGVGQLADHQNSAVRKAAVFCMVAFTCALGDERMTPHLKHLSVSKYRLLQVYISKQREESSRPPPPSSTHS-