Monarch geneset OGS2.0

DPOGS214847
TranscriptDPOGS214847-TA3681 bp
ProteinDPOGS214847-PA1226 aa
Genomic positionDPSCF300091 - 190430-206060
RNAseq coverage1244x (Rank: top 10%)
Annotation
HeliconiusHMEL0070182e-13866.60% 
BombyxBGIBMGA010015-TA1e-2151.79% 
Drosophilaens-PF8e-1740.44% 
EBI UniRef50UniRef50_Q6H2361e-7432.03%Paternally-expressed gene 3 protein n=7 Tax=Eutheria RepID=PEG3_BOVIN
NCBI RefSeqXP_001625722.13e-7426.48%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|2964771213e-7432.03%paternally expressed 3 [Bos taurus]
NCBI nr blastxgi|2964771217e-7329.12%paternally expressed 3 [Bos taurus]
Group
KEGG pathwaybta:2806871e-09 
 K03902 (F5)maps-> Complement and coagulation cascades
Orthology groupMCL25680 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214847-TA
ATGATAAACATTTGTTTCCATACTGTAGACTGTTCTCACATTAAAACATTTGTATACACAGAGGCCTTAGATCGTGCGAGGTTGGCCCGTGAGCGGCGAGAGGCCGAACAACGGAAACGCCTCGACGAGCTGAGAGCTCATGCTGCGGCAGCCCAGGCTCAGAGGGAGAAGAGGGATGAGGATCGGAGGCGGAGGATGGCCGAGGCTCGCTCCAAGGACGAGGATAGGAGGACCCAGGTTGAGGAGCGTAAGCGCGCTATATGGGAGGCGTCTGTGTCCAGGCGTGAGGCTCTCCTCCAGCGAGAGCGTGATCGTTCTGAGCGCTTGGAGCGTGCCCGGGCCGCACGCTCGTCTCCGCGGCCGGCCTTCGCCTTCGGGTCCTCCACACCCAGGCTCCTAGAGCCTGTCGACTCTGCCGGCTTCTTCTGGGCAGCGCGCAGTACCAACCCCATAGATCAGATGAACTTTTTTGAATCTCTGGCGGCATCAACGAACAATGTGATGTTTGCATCTGCGCCGCTAACACGTCGTGCGTCCGCTCTTCAGCTTGACACATCCGATACTGACAATAAAGACGAGCCGGTGTCCCCTCAGCCGCTGTGGTCGTCCGTGGCCCGTCGAAGGACCGACCTGGTGCCCACCCTCCCCGCCCGTCCAGGTAGAGCATACTCCATGACGCGCCTCGACAGGATCTCCAGCACTCCGGCCTCCCCCGCCTCGCCCGCGGCGAGGCACAGCCGCAGCCAGACCCGGTCCGGGGAGGCGACCCCGTCCCGTCCCGGCAGCTCGCTGTCGTCGGCGGCCCCGGTGCTTCGGCGCGCGTCCTCGGCGCCCCGCAAGCCGCGCCCCGCCTCCATAGCCGGTACGGGTGTTAACACGCCCCGAGGTGCGGACACGGGTGCCAACAGTGCGTCCACGACCCCGTCCGTGTCCCGGGACGCCACCGTGGTGTCCCGCCCCGCCGCGCCCCGCAGACCGCGGCCGCTCTCGCTACACGCGCCCGCCAAGAACGTTCCATCCACCCCGACGGCTCCGGGTGAGGGCAAGCCGCCGCTACATAGGAAGCCAAAGCCATCCAAAGAACATGACAAGAATCGCAAGTCTATGTCTATATCTCCGGGGCGAGGCTTGTCTCCGTCCAAGGATTCCGCGGACAAAGATCTTATGACCAAGAGCGTGGGCGGAGATAAGAGTGTGGTCAACGCGTTCGAGGTGACACAGAAGTTGGAAGCGCTGAGGATTGATTCACAGCAAGCAAGCGAAAACAAAAGCGTAGAAAATAAGCATGTAACGAATAACGAACAAATCATTGAACAGAAAGAGAAGATGGACGTACAGAAGCCAGAGAAGATGACAGAAGATAAAGAGGAGAAGAAGGAAAAGAGGGAGGAGAGAGAGGAGAAGAAAGATAAGAAGGAGCCGTCACAGGAAAAGACCGAGGACAATGAAATGACTGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTGACGCAGGAGAATGTTAACACTGGTGTGACACAGGAGGATGTCAACAAAAGTTTAACACTGGAGAATGTTAACAAAAGTTTAACACAGGAGAATGTTAATACAGGTGTGACGCAGGAGAATGTTAACACAAGTGTAATACAAGAGAATGTTAACACAAGTATAACACAGGAGAGCGTTAACAAAAGTGTGACGCAAGAAAGTGTAACAAGTGTGAAAGGAAGTCAAGACATGATAGTCGATACCACGCCAGCGATACCAAATGAAAGTGTAACGAGTGCAATTATACCAAATAAAGACGGAAGTGAAAAATCGAGGGATGACGCAGGGAGTGGTAGAAAAAGTAACACGACAAGCGAAGGAAGTGTCGGAAATGTTGCCAACACTGATAGCAGTAACGGAAACGTGCAGGGGGGGGAGGCACAGAGTCTAGGTAGTCATAGCGTGTGTCCGTCCGGTGCTCCGACTAACAACGGGAATGTGGCTGATCTACTCGGCTTGTCCACGCAAATGGAAACGGTGGACAAACCGGAGGAGCAAACGCCGGCGAAGACAATCGACAACACGCCGACACTGGACACCAACAACGGGAACACGAACGTCGGCCACGTGTCGGCCAACTTCATACAGTCGGAGCGGATGGCCGACGACTTCACCACGCACAACGGCCACGGTCACAACCATCCGCTCACACTGCAAAATGCGATCTAA

Protein sequence:

>DPOGS214847-PA
MINICFHTVDCSHIKTFVYTEALDRARLARERREAEQRKRLDELRAHAAAAQAQREKRDEDRRRRMAEARSKDEDRRTQVEERKRAIWEASVSRREALLQRERDRSERLERARAARSSPRPAFAFGSSTPRLLEPVDSAGFFWAARSTNPIDQMNFFESLAASTNNVMFASAPLTRRASALQLDTSDTDNKDEPVSPQPLWSSVARRRTDLVPTLPARPGRAYSMTRLDRISSTPASPASPAARHSRSQTRSGEATPSRPGSSLSSAAPVLRRASSAPRKPRPASIAGTGVNTPRGADTGANSASTTPSVSRDATVVSRPAAPRRPRPLSLHAPAKNVPSTPTAPGEGKPPLHRKPKPSKEHDKNRKSMSISPGRGLSPSKDSADKDLMTKSVGGDKSVVNAFEVTQKLEALRIDSQQASENKSVENKHVTNNEQIIEQKEKMDVQKPEKMTEDKEEKKEKREEREEKKDKKEPSQEKTEDNEMTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVTQENVNTGVTQEDVNKSLTLENVNKSLTQENVNTGVTQENVNTSVIQENVNTSITQESVNKSVTQESVTSVKGSQDMIVDTTPAIPNESVTSAIIPNKDGSEKSRDDAGSGRKSNTTSEGSVGNVANTDSSNGNVQGGEAQSLGSHSVCPSGAPTNNGNVADLLGLSTQMETVDKPEEQTPAKTIDNTPTLDTNNGNTNVGHVSANFIQSERMADDFTTHNGHGHNHPLTLQNAI-