Monarch geneset OGS2.0

DPOGS203394
TranscriptDPOGS203394-TA1467 bp
ProteinDPOGS203394-PA488 aa
Genomic positionDPSCF300003 + 879600-889558
RNAseq coverage314x (Rank: top 36%)
Annotation
HeliconiusHMEL0166252e-6352.26% 
BombyxBGIBMGA012300-TA2e-2641.41% 
Drosophila% 
EBI UniRef50UniRef50_E0VW584e-1436.72%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VW58_PEDHC
NCBI RefSeqXP_002430352.18e-1536.72%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420198112e-1336.72%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|3594969043e-1436.76%PREDICTED: uncharacterized protein LOC100266951 [Vitis vinifera]
Group
KEGG pathway 
Orthology groupMCL34499 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203394-TA
ATGAGCAACAATAACGACCAAAGCTCCCGTCAGCCATCGCTGTCGGGACAGCTCGATGCGTCCCCAGGCAAAAGTGTAAATAACGATAAATCGACACAGATATGTAGGGATTTTATTTGGGGATTGTGTAACAAAGGAGCACAATGTAGGTATCGACATGAACTCGATTTCGAGGCTATGAAAAAAACGCTTAAATTTTGCCACGACTTTCAAAACCCGTCTGGATGTACGCGTGAGCATTGTAATTATCTTCATACCAGTAAAGAGGAGGAAAGCTTATTTTTTGCCACTGGAACTCTACCACGTGTTTTGGCCGAACGCCATGCTAACATAACAGATTCTACGGCTGAAACTATACCACAAATAGCGTTGTTCATTCAAGAATCGCTTAGTGGGGCCCCTCCCCCGCCTCCGCCCTCTACTTTCCCTGTTTCAACAGTGTCCCCGATGGCTCCAAGACCAACAATGCCATTACAACAAATGCCACCGCCTCCTCCTCCTCCACCACCTCCTAATGCATCAACAGTTGCACCAGCCCAGACAAATCGCATATTTGCAGTGCCACCTCCTCCGCCAACACCCTCATTTCCAATAACGCAGCCTCCGCCTCCAATACCAATGTTCGATGCCAGTCGGCCGCCGCCGTCGATACCGGCGGTTGCAATCAACAAAAATGCACCACAAAAACGTCCAGCGTTGAATGATGACACCGTTTCTGTTTGCAAGTTGCGTAAGACGGATGATCTAAAGACAGAGGACGTGCAATGCGACTTGTGCCTCCAGCGAGAGATCAGAATACAGTATTACAGACAGAAGATGGAAAGAATACGAGCGGAAGACGAGTGTCAGTCGCTGGTGTACAAGAAGAAGCTGATGGAATATCAGAAGCTTAAGGATATACTGCGGACGTTGATCGACAGCGACCTGTTTAAACTTGTCGAGGAATCTTTAGGGGATGTACCGCAGTCTTCGTTCGGGGATCCTTTGACTAGTATCATTCCATCATTTCTGTCTGGAATGTTCTCAGCGGGCCCCAAACCGAACCACGACCAGTTCTTGTTGCAATTCATGGAATACATGTTCTCCAAAACGAGGAGCTTCGAGTCGTCGTCGGTGCACGTGGAGGAGTCATTGCGTACACTCTCAACAACGCCGCGTAGAAGTCACCAGTCCCCGGATATATTACAGACGCTAACGGAAATTCTGGGATCATCCTCCGAGCGTTCAGTGTCCGAGGCGAGTGGTGACAGTGCACACGCTTCAGGTACTAACGGCGTGTCGACTTTCCGTCGCTTCAACAGGCCGAGGTTAGATACACTCCGCCCTCCGGCCGCCGCGACGCCGGCGACGCCCGGAGCGCCACAGGCGTACCCGCCGTCGGTGCTGCCGCCGCCGCCTCCGCCACCACCGAGGACAGCCGCCCTCCGTCTGCCAAACCGCCTGCCTCCGAACCCTCGGCGCACATGA

Protein sequence:

>DPOGS203394-PA
MSNNNDQSSRQPSLSGQLDASPGKSVNNDKSTQICRDFIWGLCNKGAQCRYRHELDFEAMKKTLKFCHDFQNPSGCTREHCNYLHTSKEEESLFFATGTLPRVLAERHANITDSTAETIPQIALFIQESLSGAPPPPPPSTFPVSTVSPMAPRPTMPLQQMPPPPPPPPPPNASTVAPAQTNRIFAVPPPPPTPSFPITQPPPPIPMFDASRPPPSIPAVAINKNAPQKRPALNDDTVSVCKLRKTDDLKTEDVQCDLCLQREIRIQYYRQKMERIRAEDECQSLVYKKKLMEYQKLKDILRTLIDSDLFKLVEESLGDVPQSSFGDPLTSIIPSFLSGMFSAGPKPNHDQFLLQFMEYMFSKTRSFESSSVHVEESLRTLSTTPRRSHQSPDILQTLTEILGSSSERSVSEASGDSAHASGTNGVSTFRRFNRPRLDTLRPPAAATPATPGAPQAYPPSVLPPPPPPPPRTAALRLPNRLPPNPRRT-