Monarch geneset OGS2.0

DPOGS210829
TranscriptDPOGS210829-TA2115 bp
ProteinDPOGS210829-PA704 aa
Genomic positionDPSCF300027 - 231385-233795
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0176841e-12642.48% 
BombyxBGIBMGA014395-TA2e-12542.61% 
Drosophila% 
EBI UniRef50UniRef50_UPI000192758B3e-7736.59%UPI000192758B related cluster n=2 Tax=unknown RepID=UPI000192758B
NCBI RefSeqXP_002166180.11e-15443.21%PREDICTED: similar to predicted protein [Hydra magnipapillata]
NCBI nr blastpgi|2211256763e-15343.21%PREDICTED: similar to predicted protein [Hydra magnipapillata]
NCBI nr blastxgi|2211256767e-14943.27%PREDICTED: similar to predicted protein [Hydra magnipapillata]
Group
KEGG pathway 
Orthology groupMCL18525 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210829-TA
ATGAAACGTGGTCGAGCGGATTCAGTCACATGTTTGCGGACAAGTGATATGGCAAACCAAATGCGTAAAGATTTCATAAAATTATCTGAAATGGGTGAAGAGGAGATCTGGCAATTGCTAGATAATATTCCTACTGATGATGAAGGGACAGACGATGACAACGATGACGACGTTGATAGCGATCAAGGCGCCCCAAATCTTGATTTCATGCATACTGAAGATGAATTGGAACCACTCACTGACGAATCTCAGAAAACGGAAATACCTACTAATACAACAGAATCGGGCAGTGCAGTGTCAACAATTTGTCCGATTGTTCCAGAGCAACTTGATCAAAATGAAGAAATAATTTCGCTTATGCCTGACCACACAGAAGAAATTGCTGTACAGGAATCCGCGAAACCCTATCGACGCCGCAAACGTCCTCGTACCCCGGAGCCAACAGAAGAAGAAGACGGTCCCGTTGTTCAAGCTCTCGGTGTAGTGGATGATGTTGCAGACATGAAAAATGATTCTCCACAATTCAAATCTATTGTTTGGAAAAAAAAGAATCTTCACCTTCATGTAAATGAAGTGGTTTTTAGAGGTCAAAAAGAATTACCGGAGACGATTACCAGATTGGACACACCTTATAAATGTTTCCGTTACTTTATGAACGACGCCCTGTTTGACCATCTTGTTGAACAATCAAATTTGTATGCAAGGCAAAAGAACATAAGAACAAACTTCAGTGTCCAATCCGTTGATTTGCGAAAATTTGTTGGCATTCTATTGTATATGTCGGTTTATCGCTATCCAAATGTGCGATCATATTGGGGAAATAATTCCTTTGAGGCAATTCGCCAAACAATGCCCGTTTTGCGATTTGAAGCAATACGCCGGTACCTCCATTATAACGACAATGCAGCAGTAGTTACACGAGGTGACCCAGGATATGACCGTCTTTATAAAGTTCGTCCCTTGGTAAAACATTTCAATGAAAGATTTCTATCAGTGCCCATGCCTTCTAGGCTGTGCGTAGATGAACAAATGTGTGCCACAAAAATGACGGGATCCCATTTGCGCCAATATATGCCCAATAAGCCACATAAGTGGGGCTTCAAATTTTTTTGTCTTTGTGATACTTCCGGATTTTCGTACTCTTTCGAAGTATACACTGGTGCCGGAGATAACGTGATTTTTGATGGTATGCCAGATCTTGGGGCTGCGTCAAATGTTGTAGTTCGCTTGTCAAAACAAATACCAAATTTCGTAAATCACATCCTATACTTCGATAATTTCTACACGTCCCTTGGCCTGCTTACGTATCTCCGAAGTAGAGGAATTTACAGTTTGGGAACTGTGCGAGTAAACAGAGTACCCAACTGTAAATTGTCTAGCGATGCAATTTTGCAACAGAAAAAGGTTGATCGTGGTTACTCAGAAGAGTTTGTAGGTACTGCATATGGTATTGATATATCCTCTGTGCTATGGAATGATACGAAAACTGTGCGCCTATTGTCTACCTACGTTGGAGTAAAACCATTTGCGTCTAAAAACATAAACAAACAGATTTCAAAAGTAACACGTTGGGATAGAAAAAAGAAAACCCACTATGACATTGACTGTCCACAAATCATCAAAGAATATAATCGGCATATGGGGGGTGTCGATTTGATGGATGGCTTATTAGGCCGTTATCATATTCGTATGAAAACCCGGAAATGGACCAACCGAATTTTTTATCATATGGTCGACGTGGCAATGGTGAATGCTTATATACTTTATCATCGGTTGCATCCCCATGCAGATAAAATTGAGTTGCCAACGTTCAGAACACAAGTCGCAGAATCACTCTGCGTGTGCGGCACTATTCCAGTAAAACGAAGCGTTGGCCGACCATCCAATACGACACCGCCACCAAAGATACCAACAGCGAAACGAGCCTATCTGCCAACCGATGATATTCGTTATGACCAAATTGGCCACTGGTGCGTTTTTAGGGATCGGTCTGGCAAGAAGCAGTGCAAATACCCTAAATGTAAATCGGAAACTCAAGCATACTGCACTAAATGCAATCTATCTTTGTGCAGTTCAACAACAAAGACATGCTTTTATGATTTTCATAACAAATAG

Protein sequence:

>DPOGS210829-PA
MKRGRADSVTCLRTSDMANQMRKDFIKLSEMGEEEIWQLLDNIPTDDEGTDDDNDDDVDSDQGAPNLDFMHTEDELEPLTDESQKTEIPTNTTESGSAVSTICPIVPEQLDQNEEIISLMPDHTEEIAVQESAKPYRRRKRPRTPEPTEEEDGPVVQALGVVDDVADMKNDSPQFKSIVWKKKNLHLHVNEVVFRGQKELPETITRLDTPYKCFRYFMNDALFDHLVEQSNLYARQKNIRTNFSVQSVDLRKFVGILLYMSVYRYPNVRSYWGNNSFEAIRQTMPVLRFEAIRRYLHYNDNAAVVTRGDPGYDRLYKVRPLVKHFNERFLSVPMPSRLCVDEQMCATKMTGSHLRQYMPNKPHKWGFKFFCLCDTSGFSYSFEVYTGAGDNVIFDGMPDLGAASNVVVRLSKQIPNFVNHILYFDNFYTSLGLLTYLRSRGIYSLGTVRVNRVPNCKLSSDAILQQKKVDRGYSEEFVGTAYGIDISSVLWNDTKTVRLLSTYVGVKPFASKNINKQISKVTRWDRKKKTHYDIDCPQIIKEYNRHMGGVDLMDGLLGRYHIRMKTRKWTNRIFYHMVDVAMVNAYILYHRLHPHADKIELPTFRTQVAESLCVCGTIPVKRSVGRPSNTTPPPKIPTAKRAYLPTDDIRYDQIGHWCVFRDRSGKKQCKYPKCKSETQAYCTKCNLSLCSSTTKTCFYDFHNK-