Monarch geneset OGS2.0

DPOGS209131
TranscriptDPOGS209131-TA1992 bp
ProteinDPOGS209131-PA663 aa
Genomic positionDPSCF300061 - 1202762-1209945
RNAseq coverage1502x (Rank: top 8%)
Annotation
HeliconiusHMEL0081430.076.17% 
BombyxBGIBMGA001842-TA0.069.13% 
DrosophilaCG33129-PE2e-7027.18% 
EBI UniRef50UniRef50_UPI00022470A95e-10334.91%UPI00022470A9 related cluster n=1 Tax=unknown RepID=UPI00022470A9
NCBI RefSeqXP_001601024.17e-10033.48%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3454897362e-10234.91%PREDICTED: transmembrane protein 214-like [Nasonia vitripennis]
NCBI nr blastxgi|3454897363e-11835.64%PREDICTED: transmembrane protein 214-like [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[195-618] IPR0193081.8e-60Protein of unknown function DUF2359, TMEM214
Orthology groupMCL14844 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209131-TA
ATGTCTAGCGGTCAGTGGGAAGTTGTCGGTAAAAATAAGAAATCCCAAAACGGTAAAGTAAAAAATGTTAAAGAAGAAGAGAAGAAACCAACTAAGAATGGACCGAAACTTGAAGATGTCGTTCCCCATTCCCAATTAAAACAATATTACAGTGGAATGGAAATCGACGATACTCAAAAACCGCAAAATAACAAAAAGAATGCCGATAAAAAGAAAAAGCAAGATAAAAAATCCGAACCCCCTAAACCTAAGCTTCCTAAAACGATCGAAGAAGCTTTAGAAATGATTGATCTATCGGAGCTTGGAAGCATCATAACCACAAACAAACTTAGATTTTCCAATGCCCCTCTGGTCTGGTTGAAGGAAGTAGCCAATTATCTTAATTCAAAAATTCCCATTGATGTTGAGGATCCAACATTTTTGCATAACAATGCTGGCTACCCGCTATCTGCTGCTCCTTTAGAAGTCATCAAACTACTAGAAAATGTTCTCCATGACGCCGGAAAGGCGAACACGCAGTTATTTTTCGATGTATCCTTAACAGCGCTTGCTAATGATATGAGTCGAGGTCAGTCTGTCAATGGTTACAGATTGTTGCTTCAAATATTGGCTCAAAAGTATCCAGATTTTTGTCTTGTTTCTTTACCTAAAAGTATAAGTTTAAGAAACTCCTATCAAAATCGACCGCCTATTGGTCTGTCCTTGCTTTGGACTTTAGGCCAAGGCGGCTTTAACAACTTTGCTGTGGGTTTAAAAGTCTGGCAAGACTTGTTCTTCCCTCTAATAGAATTGAAGAATTACTCGAAATATGTGATATTATACTTATGTGAGATTCTTAACAAGCCCGCTGTGATGGATAGTACAAAAGTGACCCAGGACCAACTCCTGGCCATGTTCGATATGGTGAATGGCAAACGTAATTCACTATCAAAGGACCTGTCAAGTGACCTTATCAAGCAATTAAGTAAATATAAGGATATATATTTTAAACATAGCGGTAATAAGCTCCAGGTTACTTTTAACCAGCTAATGAAGAAACTGCCAAACCAGTACTTGAGTGGTAACACACTTGATAGTTACAATAAGGTCATAGTTGAAAGTCTTATTGACTGTTTGCGGTTAGACGATTCCTGCAACGCGACATGGAGACAGTTGTTCAATAGATGCAGCAAGCAATCGGCAACTGTTATTGAGTATATAGATACGAATTGGGACGAAGTTAGTCCGAGGTTGAAAAAGAAATCCTTGAAGGCGACCGTACTGCAGTTCAAGGAAGTCTGTGGTGAGACACTGAAGGGCAAGAAAAAAGATGAAACAGTCGTCAAAGCTAACAAGATCTGCCAGGATATTCTAGACAGAATGACGAGCACTAGAAGATTCCCGTGGGTTTGGGCCAGTTTGTTGGTGCTGATCAGTATTGGTGGGCTGATTGCATACGATGTATCCAGGGTCGGAGGAGATTTCCCCAAGAGTGCGACTGGTAAACTGATGAATGACTTGGGTATTCTTGAACAAAGTCAGAGAGCATGGCAAAAGGGTCTGTCTGTGTCGGCCCGGGGTTATCTGTGGATGGAAAGCAATGCACCAGTGTATTATGCCCAAACCGTGGAAGCCTGCAAGCCATATGCGCAGCTCTCCAAAGATGCAGTCACCATTGCCGTTAAGAAGCTTGGAATCCTATATGTCAATATGAAGGAATACGTTGTTGACAAGACACCTATAGTTGTGGCAACTATAGAACAGTATGCACCCGGCGTTGTTGAGACTGTTCAAAGTTACGCTGTCAATGGTTTCGCCGCTGTTAAGAAATATTCAAATGACTACTATCAAATGACAGTTGATTATTTATCGACTAAGGTCTTTGTAGGTGATTGGGCTCCGGAGATCCTTCACAACAAGACTCAGTTGGCCTTAAACGCTACAAAGCTGCACATGTGTACATATTTCCACTGGTTCAGAGAGCAGGTGAATGTGTACTCCAAGATACCGTGA

Protein sequence:

>DPOGS209131-PA
MSSGQWEVVGKNKKSQNGKVKNVKEEEKKPTKNGPKLEDVVPHSQLKQYYSGMEIDDTQKPQNNKKNADKKKKQDKKSEPPKPKLPKTIEEALEMIDLSELGSIITTNKLRFSNAPLVWLKEVANYLNSKIPIDVEDPTFLHNNAGYPLSAAPLEVIKLLENVLHDAGKANTQLFFDVSLTALANDMSRGQSVNGYRLLLQILAQKYPDFCLVSLPKSISLRNSYQNRPPIGLSLLWTLGQGGFNNFAVGLKVWQDLFFPLIELKNYSKYVILYLCEILNKPAVMDSTKVTQDQLLAMFDMVNGKRNSLSKDLSSDLIKQLSKYKDIYFKHSGNKLQVTFNQLMKKLPNQYLSGNTLDSYNKVIVESLIDCLRLDDSCNATWRQLFNRCSKQSATVIEYIDTNWDEVSPRLKKKSLKATVLQFKEVCGETLKGKKKDETVVKANKICQDILDRMTSTRRFPWVWASLLVLISIGGLIAYDVSRVGGDFPKSATGKLMNDLGILEQSQRAWQKGLSVSARGYLWMESNAPVYYAQTVEACKPYAQLSKDAVTIAVKKLGILYVNMKEYVVDKTPIVVATIEQYAPGVVETVQSYAVNGFAAVKKYSNDYYQMTVDYLSTKVFVGDWAPEILHNKTQLALNATKLHMCTYFHWFREQVNVYSKIP-