Monarch geneset OGS2.0

DPOGS209433
TranscriptDPOGS209433-TA2301 bp
ProteinDPOGS209433-PA766 aa
Genomic positionDPSCF300449 + 57443-64240
RNAseq coverage989x (Rank: top 13%)
Annotation
HeliconiusHMEL0155816e-11044.49% 
BombyxBGIBMGA001802-TA2e-7041.07% 
DrosophilaCG31342-PB5e-2638.67% 
EBI UniRef50UniRef50_E0VYH72e-3141.42%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VYH7_PEDHC
NCBI RefSeqXP_002431171.13e-3241.42%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420214776e-3141.42%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|1950374893e-3525.30%GH18364 [Drosophila grimshawi]
Group
KEGG pathway 
Orthology groupMCL26703 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209433-TA
ATGTTCCACGGCGGTGACGAGGGGCCGGTCTATTATAGAGAGATACAGAAAAATGCTTACCTCAAGAGGATACCGATCGAGAGCAGCGGGAAATTAAGGCAGCTCGGTAATAAGAGACAGCCATTGAAACCCATGTGGACATTATTCTGCATCCACAACGGTCGAGTTCCGTTCCTAGAGCAATACCCTGAACCGAAGGCTACCCTCACTCATCAGCCCACTTGGCGCGTCTGTTTGAAGACAGCGAGACATGTCACCGCCAGTGTGAAACCTCACCTCGGCGAGGAATACGACTTCCTCGTCGACACAGACCATGGACCCGTCAGGATGCTGGCCCCTAATTGGGACTCGATGCAAGACTGGGTGGCCATACTACGTACGAAACTTCACGAGCTCCGTATAGTGTCCCCCGGTGAGAATGTGTACTGCTCGCCGCCCGCACAGCCTCCCCCGCGAGCGGCGGCGAGGGATCCCACGTCACCACTCCCACCAACACCGGCAATACCACCGGACAGGGTGCCGGGAATCGATCTGACTCCGTTGACGAGGAACCTCGAATCCAACGCCAGCGATGCCGGCAACACGAACAGCACCAGTAACGGTGCAGCGGACGAGGCTGAAGCACGAAACGAATCGGTGACGTCATCAGCGACCAACACCCCGCCCCCGTCTACGCCATCGCTGACGTCACCGCCATCAAATCTGACGTCACTATCAAATATGGCGTCGATGACGTCAGATATCAACGGGGCTGATGTCGATATATCCAATTGGGACACATGCCCGCTGCCAAGCACCAGCCGCGACGAGACGAGGGCGAGCGTGACGAAGATATGCGGACAGAACATCTGCCTCGACGACTCGATACTCAAGAGGAACGTCAGCGAGAGCGACGAGGAGTTCTTCGCCGAGATAGACGAGATACGAGACGACTGCGAGTACAAGCAGCGGCTGGTCGTCAGCGAGGGAGATGGACAGAGTACCGCAAATATAACCAGGAACAACGTGACAGTGATCCAAGTGTCAAATAAGACGCCACAGACAGCGATACCGGTGATGGGAGCGAGCGTCTTCGACTTCGACTTCAAACAGAAGCTGACCATAAAACCGGAAGATGACGATTTCATAAACATCGTAAACACAGAAAACCGGAACAGCTACGGCACTGTCTACGTCGACGATTACAGACACGGCCTCACCACCGTCAGCCTGACGGGCGACGAGTCGGGTGGCGGGGGAAACGTCAAGGTGACAGCCGCCGGGAACGGCTGTGACGCGAACGAGGCCGTCAACGTCAAAGTGACAGACGGATCGAATGTGACGGACGCGACTGTCAATGTGACAGGTGATCCACACGCCAGGGACGTCAACGGCCTGTACGAGCGCCTCTGCATGGCGTCCACGTCCAACAAGGCTGCCTCTCCGTTGCCAGTACGGAGAGTGGTGAACGAGAATAAAACGAGAAAATCCTCGCTACCAAATCTAGACATACCCGAGTCGGCCTACGAATATCTGTACCCGAATAGTTCTAACGCCGACGCCATAGCTAATACTAACGCTGAAAGGATCAATGTCGCGACTGACAGCCCTAATGTTAGAGTGATAAGGTCGAATATAGAGCGGTCGCTGAGTCAGAACGCGTACGACGTCAGTCCGAGGCGAAGGAAACATAACGACAGCCCCAAGACAGATATCAAACAAGAAAAATCGGAACAACAGAAACCGGTGTGGAGAAGGGGCTTGACCGAACTGTCGCTGTTGAGCCGGCTGAGAGGTATCGGTCAAAAGAGACAGGAAGACAGGGTGACGTCATCAGTGAAGGTAGTCCATCGATCAAGAACGCCAGCCAGGGACAACGCCAGGAGAAGAAGCAACTCGCTCAATAACAGTGTGTCACCCCCGAGGCCGTTCCCCCCCCTCCAGCCTCTGTTGTGTCGGCAGGCGGCGGCGCTGCGAGCGGAGCAGGGTCGCGGGGCTAGTGTCACCTCTGTCAGGGTCAAAGACGCCCCGGTTATATGCGAGTATGAGAGGAGCGTTTGGGTCGCTCGCTGGGGTTCGAACGGGTTCCGTATATCCGGTCGCTCTGGTGACCGCATCGCAGGGATAGCCGGTTCTACTCCCAGTTCAGTCACACACGCGAGGAACTTGCTGAGGAACGCCCATACGTCGTTTGGCGGGGAGGACGACATGAACAGAATCTCATACCATGGGACGGAGGTGTCCATACTGATACAACCGTCGGGTCTGGTGAAGAAAATGAGATCAGCTCTGAAAGGAAGGCAGTTGCTAGCGATACGGTGA

Protein sequence:

>DPOGS209433-PA
MFHGGDEGPVYYREIQKNAYLKRIPIESSGKLRQLGNKRQPLKPMWTLFCIHNGRVPFLEQYPEPKATLTHQPTWRVCLKTARHVTASVKPHLGEEYDFLVDTDHGPVRMLAPNWDSMQDWVAILRTKLHELRIVSPGENVYCSPPAQPPPRAAARDPTSPLPPTPAIPPDRVPGIDLTPLTRNLESNASDAGNTNSTSNGAADEAEARNESVTSSATNTPPPSTPSLTSPPSNLTSLSNMASMTSDINGADVDISNWDTCPLPSTSRDETRASVTKICGQNICLDDSILKRNVSESDEEFFAEIDEIRDDCEYKQRLVVSEGDGQSTANITRNNVTVIQVSNKTPQTAIPVMGASVFDFDFKQKLTIKPEDDDFINIVNTENRNSYGTVYVDDYRHGLTTVSLTGDESGGGGNVKVTAAGNGCDANEAVNVKVTDGSNVTDATVNVTGDPHARDVNGLYERLCMASTSNKAASPLPVRRVVNENKTRKSSLPNLDIPESAYEYLYPNSSNADAIANTNAERINVATDSPNVRVIRSNIERSLSQNAYDVSPRRRKHNDSPKTDIKQEKSEQQKPVWRRGLTELSLLSRLRGIGQKRQEDRVTSSVKVVHRSRTPARDNARRRSNSLNNSVSPPRPFPPLQPLLCRQAAALRAEQGRGASVTSVRVKDAPVICEYERSVWVARWGSNGFRISGRSGDRIAGIAGSTPSSVTHARNLLRNAHTSFGGEDDMNRISYHGTEVSILIQPSGLVKKMRSALKGRQLLAIR-