Monarch geneset OGS2.0

DPOGS203824
TranscriptDPOGS203824-TA1605 bp
ProteinDPOGS203824-PA534 aa
Genomic positionDPSCF300010 + 2302858-2308121
RNAseq coverage1563x (Rank: top 8%)
Annotation
HeliconiusHMEL0133431e-9860.32% 
BombyxBGIBMGA003729-TA4e-9160.47% 
DrosophilaCG15747-PA4e-5137.50% 
EBI UniRef50UniRef50_E3WQS92e-5139.75%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WQS9_ANODA
NCBI RefSeqXP_002075261.11e-5941.00%GK17036 [Drosophila willistoni]
NCBI nr blastpgi|1954567283e-5841.00%GK17036 [Drosophila willistoni]
NCBI nr blastxgi|1700512132e-7244.33%conserved hypothetical protein [Culex quinquefasciatus]
Group
KEGG pathway 
InterPro domain[221-342] IPR0186128.6e-38Domain of unknown function DUF2040
[39-169] IPR0070149e-30FUN14
Orthology groupMCL17766 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203824-TA
ATGGCGAAACCAAAAAAAGATGAAACAGCTGAAGAAGCGAAGAAAATCGTAGACGATGCGAAAAATTTTATTGAGAAAGCAATCGCAGATATTGGAAAAACTTCAGCAACAAAGCAACTTATTTTAGGAACTGCGTCAGGATGGCTCACAGGTTTTATGACTATGAAAATAGGAAAGCTTGCTGCAGTCGGTGTTGGAGGCGGCGTAATTTTACTTCATATTGCAAGTCAAAAAGGGTATATTGATATAAATTGGGATAAAATCAACAAAAAAGTTGATAAAATTACTGATAAAATTGAGAAGGAAGCCACAGGAAAATCTCCAGATTGGTTTGAGAAAGTGGAAAGGTTTGTGGACCGTAAAATTGATAAGGCAGAAGAGCTACTTAAGAAAAAGGAGAAAAAAGCAAAGCATTGGTACAACAACTTTGTAGCGGGTGATGAGTATCGTGCTACAGAAACCCATGTGTTTCTGGCATCATTTGTCGCTGGAATGGCCATTGGTATTCTTTGTGGAAAGTATGGTTTAATACTCAAAGACAAAACCAAGGGACAACTATCTTTCCAATCTTCTCGAAATGTGTTTGGGTCAGATTCGGACTCAGGTGATGAGAAACAAAAAGCCCCATTGTCACTGAAGCCAAGCAGTAATATTAATAGACAGGCAAAAATAACTCAAGAAAAAGCCCTAGTAGAAGATCCAACGGTATACCAATATGATGAAATTTATGATTCCATGATTAAAGAAAAGGAAGCTAAAAAAGAGAAAACAACTGTAGATAAGAAACCCAAATATATAGAGAACTTAATAAAATCCTCAAACAAAAGAAAATTAGAAAACGAACGAAGGATTGAACGTCAGATACAAAAAGAAAGAGAAAAGGAAGGTGAGGAATTTGCTGACAAAGAAGTGTTTGTGACATCAGCCTACAAAAAGAAACTTGAAGAAATGAGAATAGAAGAAGAGAAAGAAAAATATGAAGAATATTTAGAGTCCATAGGGGATGTAACAAAACAACAAGATTTAGGTGGATTTTATAGACATTTATATGAACAAAAATTGGGTTCTAAAAAGTCAGTAGAAACTGATAAAAAGAATGATCCACTGAAAGAGGGAAGCCCTGAGCCAAAACATACAGAATTTACGAAGACCAGAAGTAAAGATGACAGAAAAGGTAAAGAAAGCAAACAAAGAAACTATAGAAAAAGGAAAGCCTCACATGAGAAATTGTCTGAAGGTGAGATTGAAGATTCTGAGGGAGAAATAAACCACTGTAACCTCGACGACATCAGAACATTGAAGAAATCAAAACAAGCTTCTGAAAATATTGATGCAGATTCAGACTTCTCAATCGATGATTCCAGCGATGAGGAAAGTAATAAAGAAATTCCACTACCTCAACAAGCAGAAGTTGAAAAATCAGAAAATGATTCTAAAAAGCTAGATGAAAACAAAACACCACCAATAAATAATGAAAAACCTAAAACAGAAATCGATAAACCCAAAGAAAAAATAGACATATGGAAAAAGAGAACCGTCGGGGAGGTCTTTGAACAGGCATTGAAGAGATATTATGAAAGAAAAGCTGCAAGGGGAACTCATTAA

Protein sequence:

>DPOGS203824-PA
MAKPKKDETAEEAKKIVDDAKNFIEKAIADIGKTSATKQLILGTASGWLTGFMTMKIGKLAAVGVGGGVILLHIASQKGYIDINWDKINKKVDKITDKIEKEATGKSPDWFEKVERFVDRKIDKAEELLKKKEKKAKHWYNNFVAGDEYRATETHVFLASFVAGMAIGILCGKYGLILKDKTKGQLSFQSSRNVFGSDSDSGDEKQKAPLSLKPSSNINRQAKITQEKALVEDPTVYQYDEIYDSMIKEKEAKKEKTTVDKKPKYIENLIKSSNKRKLENERRIERQIQKEREKEGEEFADKEVFVTSAYKKKLEEMRIEEEKEKYEEYLESIGDVTKQQDLGGFYRHLYEQKLGSKKSVETDKKNDPLKEGSPEPKHTEFTKTRSKDDRKGKESKQRNYRKRKASHEKLSEGEIEDSEGEINHCNLDDIRTLKKSKQASENIDADSDFSIDDSSDEESNKEIPLPQQAEVEKSENDSKKLDENKTPPINNEKPKTEIDKPKEKIDIWKKRTVGEVFEQALKRYYERKAARGTH-