Monarch geneset OGS2.0

DPOGS215342
TranscriptDPOGS215342-TA2106 bp
ProteinDPOGS215342-PA701 aa
Genomic positionDPSCF300120 + 439229-441669
RNAseq coverage314x (Rank: top 36%)
Annotation
HeliconiusHMEL0086510.066.53% 
BombyxBGIBMGA007976-TA0.063.78% 
DrosophilaCG10914-PA3e-12844.36% 
EBI UniRef50UniRef50_UPI0001791D591e-14942.11%UPI0001791D59 related cluster n=1 Tax=unknown RepID=UPI0001791D59
NCBI RefSeqXP_001952820.12e-15042.11%PREDICTED: similar to CG10914 CG10914-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|3227995943e-15247.69%hypothetical protein SINV_10706 [Solenopsis invicta]
NCBI nr blastxgi|3123793641e-15243.45%hypothetical protein AND_08807 [Anopheles darlingi]
Group
KEGG pathway 
Orthology groupMCL11945 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215342-TA
ATGATATATACCGTGCGTAATTCTTTAAGGTTATTAAACAGATACAGTATAACTTGTATTAATGTTATTCGAACAAGTTGCACTGAAATTCCCATTAAACAGAAACCTGTTGAAAAAAAAGACTTCAAGATTCTACTGCAGAAATATAAAGATAAAATCGTGTTCAGTTCCTACTTAGAACATGAAAAGTTAGAACTTGGTTACTTCAAGTATCACATGAAAACTATCAAGAGAGCCAAACAGACTCAAGAAAAGGCTTTACAAGAAAAAAGTTTACCACCACTACCTCTAGCCTTAAGGTATTATGTTGATAAAGAAAGACTTCTAAACAATGAAAAAGAACCGGAGTCTGTTGCCTCGGACAAAACCTTTCAACTGCCTTTTGGGGAGACGACTCCCGTTGAAATCAATGAGCACGAAACGCAAATACAAATATCACAGGAAAATGAATTTAAAACAAACGTTGACCAATGGATGACAAACTATGAATACTTTGATGACAGTCATCTCACGTCTGGTGTAAACAATGACGATACTCATAGAGACTGGAGTAAATATTACGGCACTCCCGATCCCAATGCCAGCATCAGTCAAGTACGCTGCGGGGGCTGTGGAGCCCTGTTGCACTGTAATGACCCCGCCATACCCGGATACTTGCCAAGTGAAATATATACAGGACGGAAAGTAGAGGAACTCAAGACTATGGAATGTCAGCGATGTCATTTTCTTAAGGAATACAACATAGCTTTAGATGTCAACGTTCAAGCTGAGGAATATGAAAAACTTTTGCAATCCATCAGGTATGTTAAGTCTCTATTGCTTGTAATGGTGGACCTGCTCGATTTCCCCTGCTCCTTGTGGCCTGGCATAGTGGACATCATCGGCACTGAACGACCAGTCATCATAGTAGCTAATAAGGTGGATCTCCTGCCCGGGGATAGTGTGGGTTACTTAAAGAGAGTCAAAGAATGTTTAATGTCAGAAATCCAAAAAACTAAACTGGGTGAGGCCAACATTAAGTATATAGCATTGATATCAGCCAAAACTGGTTATGGGGTGGAAGACCTTATATCGGCCATGTTTAAAACTTGGCTCTATAAAGGAGACGTTTTCTTAGTGGGTTCTACAAATGTCGGGAAGAGTTCGTTATTTAACGCGCTTCTGCAATCCGATTACTGTAAAGTACATGCAGTTGATATCGTCAAAAGAGCAACTGTGAGTCGCTGGCCCGGCACTACTTTGAATCTCCTAAAATTTCCTATCAACAGACCATCCGGATGGAAAATAAGACAGAGGAGTCATAGACTGCTCACACAGAGAAAACTCATGAAGGTAGAAAAAGAAATAAGAAAAGGCCAAGTGTTGGGGCGGGATGCGACCGAGGCTCCGTCCCTGATAGGACACATCGGCAGAACCTTCTCACAGTATCAAGCATCCAGCGACAATACAGCGGACAAACGACATCAGTTGGTGGTTATCGACGAAAAGAATCCTCTTTTCAAAAAGAGCAAATGGCTGTACGACACTCCGGGGGTGGTGCTCTCCGATCAGATCCTCTCCCTGCTCACCACCGAGGAGCTCATGCTGGCCATACCGAGGAAACTGATAAGACCTCAGACCTACTATCTGCGCGAGGGCTCCACCTTCTTCATAGGAGGCCTGGCTCGAGTGGACCTCCTGGAGAGCGAGGACGCGTGTCGCTTCACCATTTTTTGCTCCGAGAGTCTGCCGATAACCGTGACGGAGACCAGGTTCGCAGACGAGGTGTATGATAGCTTCGTCGGCACCGAGCTGTTCGCGGTGCCGACTGGCGGCGACGAGCGACTTGAGAGGTGGCCGGGGCTGAGGAGGGGCGGCGACATGGAGTTCGAGGGAGAAGGACCAGACGTCTGTTGCGGAGACATAGTGCTGTCGTCAGCGGGCTGGGCCGCCGTCACGGGCAAGGCGGGCGGCCGGTGCCGGGTGGCGGCGTGGACTGTGGGCGGGCGGGGGCTGCACCGCCGCGTGCCCGCCCTGCTTCCGGCCGCCGTCACTCTCAAGGGCCGCCGCCTGCGTGACACACCCGCGTACATGATCGGAGGCGTGTTCACCGGAGACGAGCTCTGA

Protein sequence:

>DPOGS215342-PA
MIYTVRNSLRLLNRYSITCINVIRTSCTEIPIKQKPVEKKDFKILLQKYKDKIVFSSYLEHEKLELGYFKYHMKTIKRAKQTQEKALQEKSLPPLPLALRYYVDKERLLNNEKEPESVASDKTFQLPFGETTPVEINEHETQIQISQENEFKTNVDQWMTNYEYFDDSHLTSGVNNDDTHRDWSKYYGTPDPNASISQVRCGGCGALLHCNDPAIPGYLPSEIYTGRKVEELKTMECQRCHFLKEYNIALDVNVQAEEYEKLLQSIRYVKSLLLVMVDLLDFPCSLWPGIVDIIGTERPVIIVANKVDLLPGDSVGYLKRVKECLMSEIQKTKLGEANIKYIALISAKTGYGVEDLISAMFKTWLYKGDVFLVGSTNVGKSSLFNALLQSDYCKVHAVDIVKRATVSRWPGTTLNLLKFPINRPSGWKIRQRSHRLLTQRKLMKVEKEIRKGQVLGRDATEAPSLIGHIGRTFSQYQASSDNTADKRHQLVVIDEKNPLFKKSKWLYDTPGVVLSDQILSLLTTEELMLAIPRKLIRPQTYYLREGSTFFIGGLARVDLLESEDACRFTIFCSESLPITVTETRFADEVYDSFVGTELFAVPTGGDERLERWPGLRRGGDMEFEGEGPDVCCGDIVLSSAGWAAVTGKAGGRCRVAAWTVGGRGLHRRVPALLPAAVTLKGRRLRDTPAYMIGGVFTGDEL-