Monarch geneset OGS2.0

DPOGS201371
TranscriptDPOGS201371-TA1566 bp
ProteinDPOGS201371-PA521 aa
Genomic positionDPSCF300083 - 106474-118525
RNAseq coverage491x (Rank: top 25%)
Annotation
HeliconiusHMEL0084651e-13168.51% 
BombyxBGIBMGA000534-TA3e-3591.67% 
DrosophilaCG8949-PA2e-3159.79% 
EBI UniRef50UniRef50_D6WGL51e-6439.73%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=D6WGL5_TRICA
NCBI RefSeqXP_396987.39e-7038.51%PREDICTED: similar to CG8949-PA [Apis mellifera]
NCBI nr blastpgi|3504061373e-7039.52%PREDICTED: hypothetical protein LOC100742256 [Bombus impatiens]
NCBI nr blastxgi|3504061375e-8539.27%PREDICTED: hypothetical protein LOC100742256 [Bombus impatiens]
Group
Gene OntologyGO:00055153.7e-09protein binding
KEGG pathway 
InterPro domain[174-200] IPR0012023.7e-09WW/Rsp5/WWP
Orthology groupMCL17105 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201371-TA
ATGGTAATGCATGCAAGGAAACCTCAGCGAATAAGCGATGGGTACTTTGAGAAGCACCAGACCCATCCGTATCAGAAGTACAATTCCAAGAGAATTGCTAATGACTACAGTACTTCTGACAACCGCTACACACCACTGCGATCCCCGAATGGTAATGTGTCAGCGCATGGCCATGGTGCGCTTGCGGGCCATGGTGGTCACACGGGCCATTCGGGCCACGGCCACCACCACTACAACCATATGCTTGACGACAGGGGCTATCCAATATCACGCAACTCCTACATTCAAAAAGGTTCAGACAAGGAGAGAGATAGAGACTACAAATCCTGTAGAAATAAGTATACAGATGTCAGATCACCTAAGGAGAAACGTAACAAAGACAGTGAGAGGGAGAAAAATAACTATGAAAGATGTGAATCGGAGAAAAAGAGTAGGACTAGTGGCATGAGTAGTAGTGTTAATAGAAGTAGTAAATATGACAAAAGAAGTGCTCCTACTTCAAGCGTGCCTTCGGGCGGTGAATGGTCAGAACACATTAGTTCATCAGGAAAGAAGTATTATTATAACTCTATCAGTGAAGTGTCACAGTGGGAAAAGCCTAGGGAATGGGACTCAAGACGGACCTCATCCAAAGATTCTACATATTCATCAAGAAGCAATCGCGAGAAGAGATCCAGCTCACGGTCGGGGCGTCGCGAGCCGGAGAAGACAAACAAACGATCCAACAGCACGACAGAAAGATACTGGAGCAGCAGAGAGGATGACTTACACGAACGAGGACGGACGAAGCACTCCTCGTCACAACACGACGGGAAAGAGGGTCAATCGCTGCAGGATATGGATATCTCACCGGACCGAAGCACGCCCCTGTCGGAGAGTTCTCACGGAGCGCGTGAACCGCCGAGGGACGGTGGAGTGCCTCCCGTTGTCGTCCTCGACAATTCCAACCAACCGAGCGGTTCGTTATTGGCGGCGGCGTTACCGCGCATCGTGGGAACGGCGTCGATGCAGAGTGTAGTCCACGTGGCCTTGTCGGGGTGTGGCGCGGGTGGCGGCGGACCTGGCGTGGGTCCTCCGCCATGCAACAACGGCGCTTCGCCCCTCAGCGGTGGGGACACTCCTCACCGCGACGCGGGACCACCCACGCCCACGCACTCGGAGAACATAGACCCTCATGCACCGCCGCTACATCTGGAGAACGCCCTGCCTCGGAAAATGGAGTGCCTAGGTAGCTATTCCTCAGTGGTGGGGAGCTCCTTACAGCACGCGCCGCCGATGCTCACACCGTCTCTAATAAACTATGTACGCAGCGATCTCACGGGACATGTCACAGGTTGGCCTGCTGACATACTAGAGAAACAGGCTTACAAGTTCACGGAAGAGGCGTATCAGTTAGGTTGTCTTCAATGCACGAGAGTGTCGGCTGAACTCAAGTGTTCTCGGTCAGTGGTGCGACACACAGAGATCCAAGCCACGTTGCAGGAACAGAAAATCATGTACTTGCGGCAACAGATCTCTCGCCTGGAGGAGCTCAAGTCGCAAAACTCGTTTATGTCGGAGGACTAG

Protein sequence:

>DPOGS201371-PA
MVMHARKPQRISDGYFEKHQTHPYQKYNSKRIANDYSTSDNRYTPLRSPNGNVSAHGHGALAGHGGHTGHSGHGHHHYNHMLDDRGYPISRNSYIQKGSDKERDRDYKSCRNKYTDVRSPKEKRNKDSEREKNNYERCESEKKSRTSGMSSSVNRSSKYDKRSAPTSSVPSGGEWSEHISSSGKKYYYNSISEVSQWEKPREWDSRRTSSKDSTYSSRSNREKRSSSRSGRREPEKTNKRSNSTTERYWSSREDDLHERGRTKHSSSQHDGKEGQSLQDMDISPDRSTPLSESSHGAREPPRDGGVPPVVVLDNSNQPSGSLLAAALPRIVGTASMQSVVHVALSGCGAGGGGPGVGPPPCNNGASPLSGGDTPHRDAGPPTPTHSENIDPHAPPLHLENALPRKMECLGSYSSVVGSSLQHAPPMLTPSLINYVRSDLTGHVTGWPADILEKQAYKFTEEAYQLGCLQCTRVSAELKCSRSVVRHTEIQATLQEQKIMYLRQQISRLEELKSQNSFMSED-