Monarch geneset OGS2.0

DPOGS215079
TranscriptDPOGS215079-TA1854 bp
ProteinDPOGS215079-PA617 aa
Genomic positionDPSCF300187 - 185266-192701
RNAseq coverage567x (Rank: top 22%)
Annotation
HeliconiusHMEL0086981e-16896.27% 
BombyxBGIBMGA007178-TA1e-15489.23% 
DrosophilaCG33691-PB1e-11259.88% 
EBI UniRef50UniRef50_E2B3Z89e-14642.91%Putative uncharacterized protein n=11 Tax=Formicidae RepID=E2B3Z8_HARSA
NCBI RefSeqXP_393109.19e-14742.98%PREDICTED: similar to CG33691-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3072146353e-14542.91%hypothetical protein EAI_04969 [Harpegnathos saltator]
NCBI nr blastxgi|1571081514e-14947.03%hypothetical protein AaeL_AAEL004946 [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL15847 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215079-TA
ATGGTGGAGGCCAATAAGGCGAAAGCTTTGGCCGCGTCCGTGGCGGCCGCGCGACCGCAACCCGGCGCGCTGCTGGGAGCTTCCGCACCGGCTGTAGGCGGTCAAATGGGTATCTTGAATTTTCTTACCAGAAAACCGGGAACGACGCAGCCGGCCGCAGAACAGCGGCCAGCGGCTGATGATAAGAAGACCCCGGCTCCAGACGAAGCCAGCTGGAAGGGTCACTTCGGTTGGGACATGCTCGGTAAATGTCACATTCCCTACATATACCGGTCGGGGGAGAAGTATGTGGCCGTTCGTATGGTAGAGATCAAACTCCTCAACAAATATCTTAATTACCTCCACGCAGACATATACTCGTGCACTTGTATAAGAAGTTACTACATCACAGATATAGAAGCGAGATTACTCAATGAGATCAACAATAGGCATTGTGACGGCCAGTTCGGTCGTGAGCCGTTCACCCAGAAGGATCTGGTGGTGCGGTTGTCAGATGCATATGAGTTCTATAACTTCCTTGACGTGTGTTACAACAAGCTTCTAAGAGGCACCACCAATAACAAGGATAAGTGCGGCTTCATTAGGATAAACAAGGAATCTGTTGTGCCTTACACGGTCAGGGATAGTCAGAAGTTTGTACCGTTATTCTATTTTGAAGGAGAGACTGACAATTTAAAGTTAAAGGCCGACCAGTTGAAAGGTTGGGACTTATCATACTTGAAGTTCTGTTGTAAAGTGCAAGGTATACGGAATGAACTGTTCGCGAGTGAGACATGTTCAGTGATCAGTTTGACGGACATCAAAAGTTACTTCCCCCCGGGCACCGAGTTTGAGGAGTACTGGCCTAACAAAGTCGTCGATTCACAGCTACTGATATCCGCTAAAGGTTCAGTGTCTGGTGGCGGTCAGTGGACGCGAGCGCCCCCAGCGCCTCCTCCGGCTGTCGGCGTGGGTAGCAGCGTGTCCCTGACGAGTGTACCTCGGGGGCGGCGCCCTACCCAGCGTCACTCACCAGCCTCGGCAGCTATGCAGATGCCGCATGCTGGGCTCTCAGCGGCCGCTGTGCAAGCCCTCGCCAACGGTTGGAGTCTCCCTGGGTCTCTGACGCAGGCACAAACACAGCAAGTCCTGCGGCTAGCACAGGCACAAGTTGCTGCGCAGGCGCAGGCAGCGGCGCGTTACAACAACGCTGCTATAGCAGCAGCGGCTGCGGCACTGCCGCAGCACAGGTCGCAACACGCCAGGAACGTGCAGTTTCCGAACAGTGCGATAACAATGGCGATGAGCCAACAGCCGCCGCCGCCGCTGGTGAGGAGCTCCGCCAACACCAACACTATGAACGCCCCACCCCAAGTGACTGGTTCCATGAATGGTCACACTACCTCTCACTCAGTGGACACTCGCAAACGACTCACGCCCATACCCGAGATCAGTATAAGTGGAAACCATACGCCGTATAAGGTCCAAAAAGCATTGGTAGAGAACACAATGGTTCCGTGTATCAATGCTAAACCCTACCAGTACACTGACCTCCTGATGACCCTACCTGACCTCGCGAGCCATTTCTTTCCACGGGTGTCTCTCACAAACTGCAGGGCCATGTTGGATGCATTGGAACTAACACTTTATAGGCCTAATTCCACTCAACTACAAGTACTACGCAACTCTGGCAAGTGTAAAACTGCTGCTGCCGGCGAGAACAGCATGGCGTTGGTTCAGATCCGTGACGTCATGCAGCACATGCCGCAGATAAAGTACATGCTGCGGTCAGGACTGGCCAATGATGAGCCGCTGCAACCACCTAGGCCGGCCCACGCTCAGCCCAGCCACGCTAAGAGGGCACGGGCCAACTAA

Protein sequence:

>DPOGS215079-PA
MVEANKAKALAASVAAARPQPGALLGASAPAVGGQMGILNFLTRKPGTTQPAAEQRPAADDKKTPAPDEASWKGHFGWDMLGKCHIPYIYRSGEKYVAVRMVEIKLLNKYLNYLHADIYSCTCIRSYYITDIEARLLNEINNRHCDGQFGREPFTQKDLVVRLSDAYEFYNFLDVCYNKLLRGTTNNKDKCGFIRINKESVVPYTVRDSQKFVPLFYFEGETDNLKLKADQLKGWDLSYLKFCCKVQGIRNELFASETCSVISLTDIKSYFPPGTEFEEYWPNKVVDSQLLISAKGSVSGGGQWTRAPPAPPPAVGVGSSVSLTSVPRGRRPTQRHSPASAAMQMPHAGLSAAAVQALANGWSLPGSLTQAQTQQVLRLAQAQVAAQAQAAARYNNAAIAAAAAALPQHRSQHARNVQFPNSAITMAMSQQPPPPLVRSSANTNTMNAPPQVTGSMNGHTTSHSVDTRKRLTPIPEISISGNHTPYKVQKALVENTMVPCINAKPYQYTDLLMTLPDLASHFFPRVSLTNCRAMLDALELTLYRPNSTQLQVLRNSGKCKTAAAGENSMALVQIRDVMQHMPQIKYMLRSGLANDEPLQPPRPAHAQPSHAKRARAN-