Monarch geneset OGS2.0

DPOGS202898
TranscriptDPOGS202898-TA1695 bp
ProteinDPOGS202898-PA564 aa
Genomic positionDPSCF300126 - 121447-127280
RNAseq coverage31x (Rank: top 75%)
Annotation
HeliconiusHMEL0156736e-9634.20% 
BombyxBGIBMGA004171-TA2e-15149.65% 
DrosophilaCG18130-PA2e-1622.43% 
EBI UniRef50UniRef50_E0VDV72e-1430.46%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VDV7_PEDHC
NCBI RefSeqXP_972627.28e-1724.10%PREDICTED: similar to CG14221 CG14221-PA [Tribolium castaneum]
NCBI nr blastpgi|1892375102e-1524.10%PREDICTED: similar to CG14221 CG14221-PA [Tribolium castaneum]
NCBI nr blastxgi|1954486179e-3323.83%GK24984 [Drosophila willistoni]
Group
KEGG pathway 
InterPro domain[9-127] IPR0123364.6e-10Thioredoxin-like fold
Orthology groupMCL30231 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202898-TA
ATGGCTAAAAAGAAAATTGAATTGTTTATTAATATTGACAACGAAAAAGAGTTTGAGAATGTTATAACAACTAATTCGAAAACATTGATTTGTGCTGATTTGTATAATTCTTACGCAGGACCCTGCGTTGTCCTAGATCACCTCTTTGTAAGAATTAAATTGGACTGGAGTGATAATAAGATTGTTCTTTTGAGGATTCGAGTAGACGATATATCATCACTGAGACGGTTTAAGAACCGTTGCGAACCTGTCTTTATATTTATTTTGGAAAAAAGAATTACAAAAATATTTCGCGGCGTAGACAATACAAAGTTCGCAGAGGTCGCTAAAAAGGAGGTTGAATATTTTAAAGCGAGACTGGAAGGAGTCATAGATGACAGAAAAACTTATAATTTCGATGAAGTACTACCAGAAGAACAGGAATGGGTGAAACAGCTTGTTTTGGAGGAAAAGCAAGGAAAAGCTATCGTTTTGATCAGGAAGAAGGAAAGGCAGGCTGCTCGCAAGCGTCACCGGGCCGAGCTGATGGTTCCTTACTTGCAGCACCTTAATTTCGTGCTGTACTGGCCTCATGCTGTTCACGCCCATCCGGAGCTATACGAACGATGGGATGTAAACAATATAATAATGGTTGGCCGCGAAGAAATTCAGCTCACTAAAGAAATGGCCGATGACATTTTATATGCTGGCGATGCTCCCATCAATGAAGCGTCTATGCATGTGTTGCTGTCAGGACCAGCTTTGGCTATTTGTTTCAGACTGCTTGACACAGACAAACATTTCGTCTCCCTTGTTCGTAAAATACTGTATGAAGATCTCACACCGGTGGACGAAAGTAAGCCTATTGAAATGCAACTACCACAGAAAACGGCGTACGATTACTACAAGTCGTACAGCTTAACGAAGGAAGAGATACAAATGAAACGTCGTGAAGAAATTATCAAAAGAAAGGAAGAGGAAAAGGAAAAACGTGCAAGAAGGCTATCGGAGATGCAGCGGCTAGCGAGACAAGCTATTGAAGATACCATTGAAGCTAAGAGGGCGGAAAAAGAAAAAAGGAAACTGGAGCTGCTCAAAGCTGGGAATTTATCAGCTCTGGAGACATTGAAAGAGGAACCAGATGACGAAGAAGTGGACATCACTATACCGGAGGAGCTTTCTGAGGAGGAAGAAGAGAGTTCAGAGGAAGAATGTGCGGATGAATATTTGCCGCCAGCTGGTCTTGTCATACCAGGGTTTTATGCACCACCAAACGACATATCCAAAGCTAATGGATTGGCTATACTGTTCCCTAATCTGGTATTGGAGAATGTAACTCCCGAGTTGGAGTTTCTTCCGCCACACGTTCTGGTTATGCTCGACATAACAAAACGGTATAAGGCCATAGAGGCGATGTCAAAGTATAGACGCGAGATAATACATATTGGTATTTTTAAAGCTACTAATCCTCATGATGGTGAGCACGTAGCTTTCAGTGTGAAACAGTTCGACAAAATTGATCAAAAATACGACGAGGATCTCGTCAAATTGGTTTTCATGGTTTCTATAAAAAGCGACCTGGCGCTGCTGAGTCTTGTGGACCTGGGTCCGTATTATGTCAGCGCGGATGACACATCAGGAGAACTGGAATGCGCCGCCATGTTTGATGTGCACTACGCCGATAATTATAATGAATTCGAAGATTTCTCACATTAA

Protein sequence:

>DPOGS202898-PA
MAKKKIELFINIDNEKEFENVITTNSKTLICADLYNSYAGPCVVLDHLFVRIKLDWSDNKIVLLRIRVDDISSLRRFKNRCEPVFIFILEKRITKIFRGVDNTKFAEVAKKEVEYFKARLEGVIDDRKTYNFDEVLPEEQEWVKQLVLEEKQGKAIVLIRKKERQAARKRHRAELMVPYLQHLNFVLYWPHAVHAHPELYERWDVNNIIMVGREEIQLTKEMADDILYAGDAPINEASMHVLLSGPALAICFRLLDTDKHFVSLVRKILYEDLTPVDESKPIEMQLPQKTAYDYYKSYSLTKEEIQMKRREEIIKRKEEEKEKRARRLSEMQRLARQAIEDTIEAKRAEKEKRKLELLKAGNLSALETLKEEPDDEEVDITIPEELSEEEEESSEEECADEYLPPAGLVIPGFYAPPNDISKANGLAILFPNLVLENVTPELEFLPPHVLVMLDITKRYKAIEAMSKYRREIIHIGIFKATNPHDGEHVAFSVKQFDKIDQKYDEDLVKLVFMVSIKSDLALLSLVDLGPYYVSADDTSGELECAAMFDVHYADNYNEFEDFSH-