Monarch geneset OGS2.0

DPOGS201692
TranscriptDPOGS201692-TA1266 bp
ProteinDPOGS201692-PA421 aa
Genomic positionDPSCF300096 - 496910-500087
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0168871e-11859.27% 
BombyxBGIBMGA013188-TA9e-4431.25% 
DrosophilaCG14905-PA9e-3027.65% 
EBI UniRef50UniRef50_D2A6D73e-3129.94%Putative uncharacterized protein GLEAN_15005 n=1 Tax=Tribolium castaneum RepID=D2A6D7_TRICA
NCBI RefSeqXP_321838.31e-3227.84%AGAP001309-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479657002e-3127.84%AGAP001309-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1892384761e-4430.27%PREDICTED: similar to nuclear lamin L1 alpha, putative [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL25768 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201692-TA
ATGGAGACGCCCGGTGTGGAAGCACAAAATGAATTCGACGTGTTAAAGAAGATGGAAGACGAACATTTAAGATTACAGCGACAAGTAAGAATGACGCAGACCGATAGACTTCATCGTGCTATGGGCGTGCATCCGAAATTTAGACGACAAGATTTACTACTCAGGACACTGAAGAAAGAATATCTGAGTTTACACAAAAACTTAAAGATTGCTAGGTCCGGAGCTCACAAGAACAATGACAAAAAGATGAAGAAATGCTTGAAAAAGTCTCTCATTCGTTACACCAGAACTGGTCAGGAAGCAGAAGAGGGATTGACGCTCATGTCGCAAATCGATGAGTTATTATACAGAGAAAACAAGAAGATCCTCGACTTACACAATACTGTGACTTCGTACACCGGAAAGCTTGAAGAACGTCGCTGTTTAAGCGAAAATAGGCTGTCATCGACTGAGAACAAACTGGAGGCTGCGATGTGTAGATTTAATATGGTTCAGTGTGAGAATATTAGAATACGAGAGGAGATTGAACACATGCTTCAAGACAGGGCGATTTTCAATCAAGCTTGGGACAGGATGCAGAGCGTGCTGATGAGGGGCAAGAAGTTTTTATTTGAGGTGTTCGAGTCGACTGCACTGGCTTACGACCAGCGAGACGAGTGGTGTTCCAAGCTGCGGTCGATGAAGGAGAAGGGAAGGATGGATCAGATGGTCCAAATACAGGAAATGAAAGACTTACAGAAGGCTTTTGATCACGAAATGAAACTGTATCAATTTCTTGCTAGGAAAGGAGTGATAAGAATCAATAAAATAGAAGAAGAACGACAACTAGCCAAGAAAGAGAAAGAAGAAGATGATTACAAAAAGGAATATGAGCGTTATGCTAATACTATTAATGAGATAAATGATCACACACAGGAGTATAATATAAACAAGATAATTGAGACATTTGTGAGAAGGGAACACGATAATTGGTCTCTGTACGAACTGCTCACACAGTATTGTGCGGAAAACGAATTGCTTCGACGAAGTTTGGATGGAATTCACATTGACATAGGTAGAAATGTTATTGCGTTTAGTATCTTTATTTGGAGCTCAACCCACCTCTCCTCTTTTAGTGAATCCATTTTTTTCATAAGACATACAAAGTCTTGTTATTTACTATCGACCAGAATTTGGCTATATAACAAATGTAACGAATATTTAATCATGTCAGTCCTTGAACATCAAATCCACGTTCTAAAACCTAGACTCTTAATCATGTCATAA

Protein sequence:

>DPOGS201692-PA
METPGVEAQNEFDVLKKMEDEHLRLQRQVRMTQTDRLHRAMGVHPKFRRQDLLLRTLKKEYLSLHKNLKIARSGAHKNNDKKMKKCLKKSLIRYTRTGQEAEEGLTLMSQIDELLYRENKKILDLHNTVTSYTGKLEERRCLSENRLSSTENKLEAAMCRFNMVQCENIRIREEIEHMLQDRAIFNQAWDRMQSVLMRGKKFLFEVFESTALAYDQRDEWCSKLRSMKEKGRMDQMVQIQEMKDLQKAFDHEMKLYQFLARKGVIRINKIEEERQLAKKEKEEDDYKKEYERYANTINEINDHTQEYNINKIIETFVRREHDNWSLYELLTQYCAENELLRRSLDGIHIDIGRNVIAFSIFIWSSTHLSSFSESIFFIRHTKSCYLLSTRIWLYNKCNEYLIMSVLEHQIHVLKPRLLIMS-