Monarch geneset OGS2.0

DPOGS200679
TranscriptDPOGS200679-TA1317 bp
ProteinDPOGS200679-PA438 aa
Genomic positionDPSCF300353 - 146714-149183
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0177882e-10247.39% 
BombyxBGIBMGA008913-TA8e-3953.33% 
DrosophilaCG3227-PA5e-2446.85% 
EBI UniRef50UniRef50_D6WUW05e-2645.97%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WUW0_TRICA
NCBI RefSeqXP_002077900.19e-2442.55%GD22821 [Drosophila simulans]
NCBI nr blastpgi|2700113372e-2545.97%hypothetical protein TcasGA2_TC005343 [Tribolium castaneum]
NCBI nr blastxgi|2700113371e-2445.97%hypothetical protein TcasGA2_TC005343 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[307-424] IPR0183804.5e-36Uncharacterised protein family CpipJ
[332-406] IPR0183791.2e-16BEN domain
Orthology groupMCL25551 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200679-TA
ATGGATCCTAAATTATGGTTGCTTGTTGAGTATGTCGAAGATCAGAGTGCGTTTTCTAATTATGGCGTTATAAATACTAATAATCTCATCCAACACGAGACCGATCTTCACAATGGGAAAGTTGTTTTCGTCAGAGGCAAAAGCAATGGAGCACGCAAGGCGCAAATACTTAGGATATCTGACAGCAAACGATACGTAAAAGATCTCAAAATAATGTTGGAGAGACAAGACAATCAAGTGAAGAATGTAGTTTCTTTGTGTATGAATACCATCAAAGAAATGAAGACTGGCCAAATGTTATTGGACTCTCAGGAAGGACGATCTTTGAATGCCTTAAATAGAACATGCAAGCTACCCGAACAAGTTGAGTCGAGTTCAACTGACAGCGATTGTGATGAAGAGATCCATCGCAATCTGAATCTTACTAAATCCATGACAAAACACCACTATCAATCGAAACGACCAGATTCAACGCCCAATGGACAAATAAGTCGACTTTTAAATGACAAAACTATTTTCAATAGTCGAAAGCCGTTAAAAAGCAGCACTCCTTTGCCGGAAAAACGTGTTAATGATATTATTAGGCTTACTTTTGACCAAGGAACGCAGACAGACCCAGTTCAACATCCTCCAATCAACAAGATAGAACAATTAGAAGTAGTTTTACGACGTCTCTATGGACAATTCCTTGCACTGATTGCTGACGTTCAACTGAAAGAAAACCAATGCGCAATCGGAGCGGCCGAGAAAAATTATTCAGAATCAAATGTAATAAAAGACTTTGTTAATACAGACACTCAATATTTAGCTAACCCTGATCTTGAATTAGACAAAAATAAAGCAAGAGTTCTTCAAGTTCGTAGAGCATCCGCTCAAACAACAAATACTGGTTTGTCGATGGTGGACAATAATACAGATATGGTATCCATTGGAAGCGGAAATGTTACCGTGCCAGCAAGGTTATGGGCTAATATGGATTGGACGTCTCATACCTCCGCAACGCGACAACTGCTGCAAGCTGTTTTCCCAAGGAGAGTTTTAGCTACCCATTCCCTAACTGGTAAACAATCACCAGCATTTGTGGACAAACCACCAAAACAGCAATTGGATCCAAAGCTCGTTGACGACATCGTCAGCACGGTATCAGAAAGATGTAGCGTTCCAAAAAGAATTGTAAGAAGCTCTATAACCACCAAATGTACTGATGAAGCAAAGCTGTACAGAAATCGCATGCTACAAAGGAAACGTGATCAACGTAATCCGGAAAATATATCACCGCTGGCCTCTTCTAATGAGTCTTCTAATGCAAGAGATTAA

Protein sequence:

>DPOGS200679-PA
MDPKLWLLVEYVEDQSAFSNYGVINTNNLIQHETDLHNGKVVFVRGKSNGARKAQILRISDSKRYVKDLKIMLERQDNQVKNVVSLCMNTIKEMKTGQMLLDSQEGRSLNALNRTCKLPEQVESSSTDSDCDEEIHRNLNLTKSMTKHHYQSKRPDSTPNGQISRLLNDKTIFNSRKPLKSSTPLPEKRVNDIIRLTFDQGTQTDPVQHPPINKIEQLEVVLRRLYGQFLALIADVQLKENQCAIGAAEKNYSESNVIKDFVNTDTQYLANPDLELDKNKARVLQVRRASAQTTNTGLSMVDNNTDMVSIGSGNVTVPARLWANMDWTSHTSATRQLLQAVFPRRVLATHSLTGKQSPAFVDKPPKQQLDPKLVDDIVSTVSERCSVPKRIVRSSITTKCTDEAKLYRNRMLQRKRDQRNPENISPLASSNESSNARD-