Monarch geneset OGS2.0

DPOGS213316
TranscriptDPOGS213316-TA2484 bp
ProteinDPOGS213316-PA827 aa
Genomic positionDPSCF300516 + 25369-36943
RNAseq coverage643x (Rank: top 20%)
Annotation
HeliconiusHMEL0047377e-12975.07% 
BombyxBGIBMGA010860-TA2e-0928.86% 
Drosophilalilli-PA1e-2729.51% 
EBI UniRef50UniRef50_E2BGT01e-4240.13%AF4/FMR2 family member 3 n=4 Tax=Formicidae RepID=E2BGT0_HARSA
NCBI RefSeqXP_001120047.19e-4741.36%PREDICTED: similar to lilliputian CG8817-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3287895712e-4541.22%PREDICTED: hypothetical protein LOC724254 [Apis mellifera]
NCBI nr blastxgi|3838534857e-5527.73%PREDICTED: uncharacterized protein LOC100877505 [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[38-828] IPR0077978.5e-43Transcription factor AF4/FMR2
Orthology groupMCL21950 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213316-TA
ATGTTGGGATTCCTTACAACCGGCGGCTTTGGATACCAGAACCGGCTGACATTGCTTCAGCAGCTGATAGAATGTGGTGATGTTCTACTCGGAAGGGAATTCGTTCGTGGTTTGGACAATCGGATATCAAACTCAAGGAGTTCAGTATCGCCTGACGTGGTGAACAAAGATCTCGGTCTGTCAGAGAGCGATGATGAAGTAGCCGCAACCACCACAAGGCTTGAACCCATCCTCTCTCCTATCGGTTCCGGTGGTTCTCCGAGTTCTGGGAGTTCGTCTCCATCAGATTCTGAATCGGAGTCATCATCAGCGGAGTCCAGCCCGGCGGTGCCTCCGCCCGCACCCGTCCCAGCACCACAGAGGTCTTCTTGGAGCCTGTCGAACTTCGCCCCACCACCGCCACCCGACCACGCGGATCTAAGTAATGTTTTGGCCGACGCTAAAGGGAAACCGGTTCGAACCCAAACCTCGCCCACCATAGCTCAGAACACGGGTGTGAAACGCGGCAGGCCCCCCAAAGCCCGGACTTCGGAAGAAAGGCCGGCAAAGAAAAAGCGTGGCAGACCTCCGAAAACGAGGCCCCCATCACCGGCCCCGGACAGTGAGGCTGAGGCTGACCCCCCGCAACACCTCCAGGAAACCAAGACCCATATATTCCGCAAGGTTTTCACGCCCAAGAAGACGGACGATGGATGTGGTGGAAAAGGTGGGAAAGGAGGAAAAGGGAAAGGGAGTAAGGGAAAGAGTCAGGTGACCATCATAGAGGTGACGGCTCCTGAACTAGGTACTGGCGGTGAAGATAGGAGACACAGTGAGAGATCAGCGGAACGGAGGAACGAAGAGGCGATCGCCAGAGTATCACCGCCACGGGAACCGGAAACACTCACACGGAAACGGGAAGCGGATAGAATGGCCAACGAGAGGATACAGGAAAGAGTCTCTATAGAAAGGCACGACCGAATACAAGAGAGACCAGACAGAACCCCCATCGACCGAATACTGGACAGAACCGAGCGGATACCGGAGAGACGGTCCAGTGACCGATTACCACCCGACAGAATATCCAACGATCGGATATCTTCAGATCGATTGTCGGATCGATCGTCCCTTAACAACTCGGAAAGGAAATCAACAGATCGGTTATCGACTGACAAAATGTTGATCGATCGGATACACGGCGACCGGATTTCGGTCGACAAAATGTCTCACGACCGGCTATCCGGTGATAAACTCTCTCACGATAGGTTGTCGGGCGACAAGTTGTCGCACGACCGACTATCGAACGACAAGATGTCGCATGACAAATTATCAATAGACCGGTTATCAGGGGATAAGTTACCTTTAGATAGAGTCGAAAGGGTCGCACCAGATCGCTATATAAGGGCGGAGGCTGACGTCGGTTTCGTACAAAACTCATTACCGAACGGTTCAGAGCGTCGTCGTTCGCGGTCGTCTCGTCTAACAGTGTCCATACCTCTGGCAAGATTGAGAGTTGAAACACTCCGAGTATTACAACGACCGCAAAGAAATGCCCCAGCGCCCACGCCGCCGACGCCGCCCACACCGCCCACGCCGCCCACTGACCACACGCTAGAGGGCGGTAGTAGCGGCGTTGAACGCGCTGGTCGTACTCCTATATACTATTCATACTTTGAAAGACTACCAGCCGACGCCCTGTCTGACGAGGAAAGGGACCATAAATATTACCTCGGCGAAGCCAAACGTCTGCGCACAGAGGCCGAGAGGGAACAAGAGCCCCTCGCGCGAGTGATGCTGTATTTAGAATCAGTGCTGTGCTTCGTCCTGACCGGTCGAGTGCTGGAACTAGAACTCGATACTAAACGAGCCTTCACTATATATCGAGAGACTATCGAATACATTAAATCGATACACTCGATGCCGCAGCGGTTCAGAGCCTCCCCACATTCAACTTTCAGCAAGTTGGATATCTTGAGTTTGCGGGTACAAGCGTTGCTGTACTTGCGCATGTTCAAGATGTACAGTCGGGAGGTGAAGGAATACAACAAGATTGTACAGGAGTATCAGCAGAAGCCGGCGTGTGCGGAGGCCGTGTCCGGTGGCGCGGGTGTGACCGTATCGTCGGCGCGCTCGGCGTCCCCTCTCTCGCCGTCTCCGTCGCCCGCGGGGTCGGTGGGCTCGGGCTCCGGCTCGGGCTCGGGCTCCGGTTCGGGCTCGTCGGGCTACTGCTCGCTAGCCCCCGGGGCTTACTGTACACTCGCCCACGCGGTGCCCGCCCACGCCCACCACGCTCTGCTACAACTGACCAAGTATTACACGTTCCTGTACGTAGCACACGACCTATGGGAACAGGCCGACTGTCTGTGTCGACTGAGACCCAATCAAGATCTATTCATCGCCGTGGACCGTAAGTGCGGGCCGCTAACTTTGTTTTCTACGTTCCGTCATCTAGTGCAGTACGTGAGGCACGCTATATCGCTTCTCAAGAACGCCGCCAGGGAGTGA

Protein sequence:

>DPOGS213316-PA
MLGFLTTGGFGYQNRLTLLQQLIECGDVLLGREFVRGLDNRISNSRSSVSPDVVNKDLGLSESDDEVAATTTRLEPILSPIGSGGSPSSGSSSPSDSESESSSAESSPAVPPPAPVPAPQRSSWSLSNFAPPPPPDHADLSNVLADAKGKPVRTQTSPTIAQNTGVKRGRPPKARTSEERPAKKKRGRPPKTRPPSPAPDSEAEADPPQHLQETKTHIFRKVFTPKKTDDGCGGKGGKGGKGKGSKGKSQVTIIEVTAPELGTGGEDRRHSERSAERRNEEAIARVSPPREPETLTRKREADRMANERIQERVSIERHDRIQERPDRTPIDRILDRTERIPERRSSDRLPPDRISNDRISSDRLSDRSSLNNSERKSTDRLSTDKMLIDRIHGDRISVDKMSHDRLSGDKLSHDRLSGDKLSHDRLSNDKMSHDKLSIDRLSGDKLPLDRVERVAPDRYIRAEADVGFVQNSLPNGSERRRSRSSRLTVSIPLARLRVETLRVLQRPQRNAPAPTPPTPPTPPTPPTDHTLEGGSSGVERAGRTPIYYSYFERLPADALSDEERDHKYYLGEAKRLRTEAEREQEPLARVMLYLESVLCFVLTGRVLELELDTKRAFTIYRETIEYIKSIHSMPQRFRASPHSTFSKLDILSLRVQALLYLRMFKMYSREVKEYNKIVQEYQQKPACAEAVSGGAGVTVSSARSASPLSPSPSPAGSVGSGSGSGSGSGSGSSGYCSLAPGAYCTLAHAVPAHAHHALLQLTKYYTFLYVAHDLWEQADCLCRLRPNQDLFIAVDRKCGPLTLFSTFRHLVQYVRHAISLLKNAARE-