Monarch geneset OGS2.0

DPOGS216019
TranscriptDPOGS216019-TA1404 bp
ProteinDPOGS216019-PA467 aa
Genomic positionDPSCF300078 + 809189-821540
RNAseq coverage340x (Rank: top 34%)
Annotation
Heliconius% 
BombyxBGIBMGA001090-TA6e-4772.00% 
DrosophilaCG9646-PA1e-8254.42% 
EBI UniRef50UniRef50_D2A0322e-12548.32%Putative uncharacterized protein GLEAN_07350 n=1 Tax=Tribolium castaneum RepID=D2A032_TRICA
NCBI RefSeqXP_975467.24e-12648.32%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892365289e-12548.32%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892365287e-12048.69%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[1-119] IPR0191416.7e-50Protein of unknown function DUF2045
Orthology groupMCL14326 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216019-TA
ATGGACACGAAGGGGGAAGGCGAAGAGATGACTTACCCTCACATCTGCTTCATGGTGGACAACTTCGATGAAGTGTTCTGCGACATCGTTGTCCGAGATGGTGAGATGGTATGCGTGGAGCTGGTGGCTCGTGGACGGGGGGGAGCCGCTCAGGCAGTCATCTTCCTTGGATCGATAAGATACGACGCTCTCACCAGGGTCTACGATGCCCGGCAGTCTTCGTTGTCTACGAAGGTTGCTCAGCGTATGTCTTTCGGTTTGCTGGGACGGGGAGCCGGCGCCGCGGCCAGGTGCGAGTTCGTGCGTATGAAAGGACCAGGGGGGAAGGGACACGCTGAGGTGGCTGTATCTCGAGCTCGTTCATCAGGAGTACACACTCCGTGCTCAGAGCCAGGGTTAGGTCCGGAGCTGTGGGACAGTGACTTCGAAGACGACCCCGACGAGTTGTTCCTATACAGGCATCAGCGTCGCCTGAGCGACCCCAGCGCCAACCTGAATCACTTCGTGCGTGGCGGATGGCGTGCGAGGGATAAAGACGACACGAGCCGATCTGAGAACGAGGGTCTGGATGCTCTGGCTGACGGACTCGCTGAGGTGGAAGCCGGGGACTTGAGAGACGGGCGCTCGGGTCGCGGGCGGCGGGCTTCCTCGGTGCGGTGGAGGACGTGCTGCGCGGGCGCCGGCGGGGGAGGAGGGGGCGGCTCGCGGCGGGAGGGAGACATGAGGGACGTGTACCGTCGAGCCCCGCCGGACCTGTCCCCCGGGTGTCTGCACACCGTGTCCCCTCGCCGCAGGCCCGTCAGACCCGCGCCTCAGCCGCCGCCGCTGAGGCCGCTACCAGCTCGCGTCTCGCGACCTGCTCCCCCCGACACAGAGTGCGAGCTGGCCTGCGACGCCAGCTGCCTGGATGGACGATCGTCGCCCGCACCAGCCCCCGCCGCCGCGCGCCGTCCGCCCGCCTGCATGGCGCCTCTAACGGTTCGTGGAGAAGAGGAGCTGCCGAGCCAAGCCCAGGGAGAGGGCGAGGCCCGCGCGGACGGGGAGCGGTCGGGTGGCGCGTGGAGCGCGGGGGCGCGGGCCTACGTGACGCTGCCCCGGCGGCGCGGCGTGGCGGCTGTGCTCTTCCGTAACGCGCCACTACCTCCTCGACGGACCACGCCCGACGGAACAGATATCTACTACTGGTGTGACGTGCCAAGACGCAAGCTCACCGAGTTGGACGACGGAGCTTACAACCCTTTATGGGCGATGCGCGGCTTCACTCAGACCTTCCATCTGTGGAAGGAGGGCCGGAGACAGCAGTCCGCGCCGCTCTGCGCGTTCCTCACATACGTCACACTGCCCTGGTGGAGTATAGCCAAAGATATTCTGGACCACCGCGAGGAACCCATCCTGACCTTCTGA

Protein sequence:

>DPOGS216019-PA
MDTKGEGEEMTYPHICFMVDNFDEVFCDIVVRDGEMVCVELVARGRGGAAQAVIFLGSIRYDALTRVYDARQSSLSTKVAQRMSFGLLGRGAGAAARCEFVRMKGPGGKGHAEVAVSRARSSGVHTPCSEPGLGPELWDSDFEDDPDELFLYRHQRRLSDPSANLNHFVRGGWRARDKDDTSRSENEGLDALADGLAEVEAGDLRDGRSGRGRRASSVRWRTCCAGAGGGGGGGSRREGDMRDVYRRAPPDLSPGCLHTVSPRRRPVRPAPQPPPLRPLPARVSRPAPPDTECELACDASCLDGRSSPAPAPAAARRPPACMAPLTVRGEEELPSQAQGEGEARADGERSGGAWSAGARAYVTLPRRRGVAAVLFRNAPLPPRRTTPDGTDIYYWCDVPRRKLTELDDGAYNPLWAMRGFTQTFHLWKEGRRQQSAPLCAFLTYVTLPWWSIAKDILDHREEPILTF-