Monarch geneset OGS2.0

DPOGS210236
TranscriptDPOGS210236-TA1224 bp
ProteinDPOGS210236-PA407 aa
Genomic positionDPSCF300196 + 228204-230479
RNAseq coverage1155x (Rank: top 11%)
Annotation
HeliconiusHMEL0212773e-8267.57% 
BombyxBGIBMGA002552-TA2e-7174.30% 
DrosophilaCG6066-PA1e-4876.47% 
EBI UniRef50UniRef50_D6X3Z62e-5356.02%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X3Z6_TRICA
NCBI RefSeqXP_969888.14e-5456.02%PREDICTED: similar to CG6066 CG6066-PA [Tribolium castaneum]
NCBI nr blastpgi|3503994688e-5460.43%PREDICTED: UPF0396 protein CG6066-like [Bombus impatiens]
NCBI nr blastxgi|1571291145e-8147.16%hypothetical protein AaeL_AAEL011361 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[302-402] IPR0092692.5e-50Protein of unknown function DUF926
Orthology groupMCL11994 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210236-TA
ATGGCTGGAGATCGTGACAAGCACAGGCGCAGTAGTTCAGAAAGACATCGAAGCAGAAGTCAAAGTCGAGATCGACGTAATAGTCGCGATCGTGAGAAACGGAACCGCAGTAAGGATCGGTCCCGGAAGAAGAGCAGGGAGCGTCATGGGAATAACCGGGATAGCAGAGAAAGACGCGGTAGCAAAGATAGACGAGATAGCAGAGAAAGACGAGATAGCAGAGAAAGAGATAGACGAGATAGCAGAGATAGACGTAACAGCAGGGATAGATATAGACACAACAGTAGAGATAAGCGTAGAGGGAGCAGCGAGAGAAGTAAGAGCAGAGATCGAATGAAAGATAAACATAAGAGTCGAGATGGTGGTAGCAGACACACAAGTAGTACCAGAAATATACATCGAGACCGAGGTCGCAGTTCATACAGAGAATTCAGTGGCGGGGGATTAGGAGGAGGACTCGGTGGAGGTAGCCTTGCACCTCGTAGTCGCTTCAAGCCTCAGGAGGAAGAGTTCCTTGATGCTCGGAGAGAAGAGAGAGAAAGAATTGGAGAAATTGGAGTTTCATCGGTATGGGGCAAGTCACCCATAAGAGATGATTCAGACGAGGAACCTCCAGATTTGCCATCTCATAATGGCAAAGAGAAGAAAAAGAGTAAGAAATCAAAAGAAAAAGACACAAAACAGAAGTTAAAGAAGTTGAAGAAGAAACTCAAGAAGGCAAAAAAGGCTCGTAAGAAAGCAAAGAAGAAATCCAAGAGTTCCAGCAGCGACAGTTCTAGTAGTGAGGAGGAAGTGTGGGTGGAGAAAGGAAAAGAGGATGAAGTCGCAAACAAATCGTCGAAGACAGATGAGTTAGCATCAGAAGCGTTCGGTCCGCTACCGCGAGTGGGCCCGGCGCTGGGGCACAAAGACTTCGGACGAGCGCTGCTTCCTGGAGAAGGAGCGGCCATGGCGGCCTATGTCGCGGAGGGGAAACGAATCCCCCGGAGAGGAGAGATAGGACTCACCTCCGACGAGATCGCCTCATACGAAGCTGTGGGATACGTCATGAGCGGTAGCAGACATCGTCGTATGGAGGCTGTTCGTATCCGTAAGGAGAACCAGATCTACAGCGCGGACGAGAAGCGCGCGCTGGCCGCCTTCAGCAAGGAGGAGCGCTCGAGGAGGGAGGCCGCCATATTGTCACAGTTCAGAGACGTGCTCAGCGCCCGGGCCCGCGAGTGA

Protein sequence:

>DPOGS210236-PA
MAGDRDKHRRSSSERHRSRSQSRDRRNSRDREKRNRSKDRSRKKSRERHGNNRDSRERRGSKDRRDSRERRDSRERDRRDSRDRRNSRDRYRHNSRDKRRGSSERSKSRDRMKDKHKSRDGGSRHTSSTRNIHRDRGRSSYREFSGGGLGGGLGGGSLAPRSRFKPQEEEFLDARREERERIGEIGVSSVWGKSPIRDDSDEEPPDLPSHNGKEKKKSKKSKEKDTKQKLKKLKKKLKKAKKARKKAKKKSKSSSSDSSSSEEEVWVEKGKEDEVANKSSKTDELASEAFGPLPRVGPALGHKDFGRALLPGEGAAMAAYVAEGKRIPRRGEIGLTSDEIASYEAVGYVMSGSRHRRMEAVRIRKENQIYSADEKRALAAFSKEERSRREAAILSQFRDVLSARARE-