Monarch geneset OGS2.0

DPOGS211103
TranscriptDPOGS211103-TA1872 bp
ProteinDPOGS211103-PA623 aa
Genomic positionDPSCF300007 - 939984-942586
RNAseq coverage232x (Rank: top 44%)
Annotation
HeliconiusHMEL0124600.092.03% 
BombyxBGIBMGA002978-TA0.090.55% 
DrosophilaTM9SF4-PA0.073.14% 
EBI UniRef50UniRef50_Q9V3N60.073.14%GH02822p n=100 Tax=Bilateria RepID=Q9V3N6_DROME
NCBI RefSeqXP_001658596.10.076.07%transmembrane 9 superfamily protein member 4 [Aedes aegypti]
NCBI nr blastpgi|1571166540.076.07%transmembrane 9 superfamily protein member 4 [Aedes aegypti]
NCBI nr blastxgi|1571166540.076.32%transmembrane 9 superfamily protein member 4 [Aedes aegypti]
Group
Gene OntologyGO:00160211.1e-219integral to membrane
KEGG pathway 
InterPro domain[1-623] IPR0042400Nonaspanin (TM9SF)
Orthology groupMCL13159 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211103-TA
ATGTTCAAAGTTGAATTTCTGTTTTTAGTTAATTTAATTCTTTATAGTCATGGTTTTTATGTTCCTGGAGTGGCTCCAGTGGAGTTTAAAAAAGGTCAAAGAATAGAAGTTAAAGCAGTGAAAATGACAAGTATACATACGCAATTACCATACGAGTATTATTCCTTACCCTTATGTATTCCTAAAAATGGAACATTTATATACAAATCAGAAAATTTAGGCGAAGTTTTGAGAGGTGATCGTATTGTGAATACTAATTACGAAGTTCATATGGCAGAGAATATCAAATGCAAACTTTTGTGTCACAAAAGAAATAATCCAATGAACTGGAGCGTGGAGGAATCTGAGAAGGTTGCGAGTCGAATTGAACATGAATACTTTGTACATTTATTAGTTGATAACTTACCAGTTGCAACAAAGATTATAAATATTGATACTTCTGAAAGAACTATAGAACAAGGATATCGTCTAGGTTTTATGTCAAAAGGAAAGGCATATATAAATAACCATCTGAAGCTTCTTCTGAAATATCACAGACACAGTCAGGATTCTTACAGAGTTGTTGGCTTTGAAGTTGAGACATTTTCTGTGGACAAAGATCACTTGACATTCATTGATGATAACTATTGTCAAATTGGATCAGACATTAAACCACAACTTGTAAATGAGGATACGGGAACTAAATTGTACTTTACATATTCTGTGGAATGGGGAGAATCAGATATTGAATGGGCGTCAAGATGGGATATATATTTGGGCATGAAAGATGTTCAAATACATTGGTTTTCTATTGTCAATTCAATTGTTGTTCTTTTTTTCCTCTCAGGTATCCTAACTATGATAATGGTGAGAACACTCAGACGAGACATAGCTAAATACAATTCAGATGAAAATATTGAAGATATGATAGAAGAAACAGGTTGGAAGTTAGTTCACGGCGATGTCTTTAGACCGCCACCTAAAAGAATGCTTTTCGCAGCTGTTATAGGAAGCGGCATACAAATTTTCCTTATGGCCCTTATCACAATTTTCATTGCAATGCTTGGAATGCTGTCCCCTGCTAGTCGAGGTGCGCTTATGACATCTGCAATATTGTTGTATGTCTTTATGGGACTAATAGCTGGCTATTATTCAGCGAGATTGTACAACACAATGAAAGGCAAACAGTGGAAGCAAGCTGCATTTTTAACATCTACATTATACCCGGCTATTGTTTTTGGGACATGTTTCTTTTTAAATTTCTTCATTATGGGAAAACACTCCAGTGGCGCCGTGCCATTTTCGACGATGTTGGCACTTTTATGTCTGTGGTTCTGCATATCTGTACCTTTAGTGTATTTTGGTTATTATTTCGGATGTCGGAAACAACCATTTCAGCATCCAGTGCGTACAAACTTTATTCCGAGGAAAGTACCAGAACAAGTTTGGTATATGAACACATTAATTTGTATAATGATGGCCGGCATACTGCCATTTGGAGCTGTATTCATAGAATTATTTTTCATTTTCAATGCGATATGGGAGAATCAGTTCTATTACCTCTTTGGATTTTTATTTCTGGTTTTTTGCATACTTGTTGTATCTGTCTCCCAAATATCCATTGTAATGGTATACTTTCAACTCTGTGGCGAGGATTATCATTGGTGGTGGAAGAGCTTCATCATCTCCGGAGGATCTGCAGTTTATATTTTAATTTACTCAATATTTTACTTCTTCACAAAGTTAGAAATAACTGAATTTATACCAACATTACTTTATATTGGCTACACAGGTCTAATGGTACTGACATTCTGGCTTTTGACTGGGACTATTGGATTCTTTGCAGCTTATACATTCATCAGGAAAATCTATGCAGCAGTTAAAATTGATTAA

Protein sequence:

>DPOGS211103-PA
MFKVEFLFLVNLILYSHGFYVPGVAPVEFKKGQRIEVKAVKMTSIHTQLPYEYYSLPLCIPKNGTFIYKSENLGEVLRGDRIVNTNYEVHMAENIKCKLLCHKRNNPMNWSVEESEKVASRIEHEYFVHLLVDNLPVATKIINIDTSERTIEQGYRLGFMSKGKAYINNHLKLLLKYHRHSQDSYRVVGFEVETFSVDKDHLTFIDDNYCQIGSDIKPQLVNEDTGTKLYFTYSVEWGESDIEWASRWDIYLGMKDVQIHWFSIVNSIVVLFFLSGILTMIMVRTLRRDIAKYNSDENIEDMIEETGWKLVHGDVFRPPPKRMLFAAVIGSGIQIFLMALITIFIAMLGMLSPASRGALMTSAILLYVFMGLIAGYYSARLYNTMKGKQWKQAAFLTSTLYPAIVFGTCFFLNFFIMGKHSSGAVPFSTMLALLCLWFCISVPLVYFGYYFGCRKQPFQHPVRTNFIPRKVPEQVWYMNTLICIMMAGILPFGAVFIELFFIFNAIWENQFYYLFGFLFLVFCILVVSVSQISIVMVYFQLCGEDYHWWWKSFIISGGSAVYILIYSIFYFFTKLEITEFIPTLLYIGYTGLMVLTFWLLTGTIGFFAAYTFIRKIYAAVKID-