Monarch geneset OGS2.0

DPOGS215597
TranscriptDPOGS215597-TA1410 bp
ProteinDPOGS215597-PA469 aa
Genomic positionDPSCF300097 + 260913-281997
RNAseq coverage15x (Rank: top 82%)
Annotation
HeliconiusHMEL0169216e-15089.83% 
BombyxBGIBMGA008820-TA2e-14780.06% 
Drosophilabves-PB5e-3336.04% 
EBI UniRef50UniRef50_D6WJ445e-10051.92%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WJ44_TRICA
NCBI RefSeqXP_970119.16e-9651.34%PREDICTED: similar to blood vessel epicardial substance [Tribolium castaneum]
NCBI nr blastpgi|2700072432e-9951.92%hypothetical protein TcasGA2_TC013795 [Tribolium castaneum]
NCBI nr blastxgi|2700072432e-9950.00%hypothetical protein TcasGA2_TC013795 [Tribolium castaneum]
Group
Gene OntologyGO:00160203.3e-55membrane
KEGG pathway 
InterPro domain[92-349] IPR0069163.3e-55Popeye protein
[224-335] IPR0147102.1e-06RmlC-like jelly roll fold
Orthology groupMCL18341 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215597-TA
ATGAAAGTGCAAGATATGCGGACGTTCCGTGAACATCACCCTACCAGTATCACCACCGCCACAAGTGGGCGGAGAGACGGGACGCTTCCTCGGCTGCTCAGAGCCCAGCTGAATCGTCTCGTGTCGAGTGAGCGGCGAGGCGTCAGGGGGGAAAGGAAGAGGGGGGGCGAGGGGGCAGCCGGCGCGCGGTCGTCGGCGCGCGAGCGCGGCATGGCGCCCGCCCGACCAGCGCCGGCGACCGCCGCGCCCTGGCTGAACGCCACGCTGGCATACAACCTAACCCGTATCAACACCACCTTTGATACCGATTACGAACTACTCAACGCCAGCACCACCGATGAACACGACAACCACTGGTTCTGCGCTAAGTGGACCAGCGCACAGCAAGACCTCTTTCAGGCTGCAAATCTCTGCTTCGCGATCGCGTTTCTTGCACCGAAGAGTTTCAAGCAGAGCATCTTGGTGCTACGCGCGCTAGCAGCTGCGGGTGCAGTTTTGATGGGCATGTGGGCCGGCGCAGAGGTTTGCGCACCAGACGTCCTAGCCTGGAGTTTGGCACTGGTTCTAGTCAACTCTATACATACCATCTTCTTAATAATAAGGTTTCTTCCACCGGCGTTGTCATTAGAGCTGACAGATTTATACCTGAAACTATTCAAGCCTCTGAAAGTGAATAAGAAGCACTTTCAGGAGTTGACTCGTGAAGCCCGTGTCATCAGGCTTGAGCCGGGCGAGGCGTACGCTGTCGAAGAAGTCACGCCTGCGGACGAAAGACTGTCGATATTATTAAAGGGAAAAATGCGAGTGACTTGCGATGAGACCCACCTTCATTATATACAGCCCTACCAGTTCGTAGACTCGCCCGAGTGGGAAGCGAACCGTGAACAGTCTGACGACGTCTTTCAGGTGACAGTGATAGCTGAGGAGGTATGCACGTGCGTGTGTTGGCCGCGCATGCGCCTGGAGCGAGTGCTGCGTCACCGCCCTGCACTCAAGGTCGTTCTCGACTGTCTCATAGGTAAAGATATAACGCACAAGCTATATTCCGTGAGCGAGGGTCTCGGTGCAGGAGCGACAGGTGAAGGAGAAGGCCCGCCTGCACACCTCAAGCGCTCCGCTAGTGTCGATGCTGTTCATGAAGGGGCCCGCGGAAGACTAAGGAGCTTGGCTTGGAGAGCCAGATCCACTTCTAAGAAAGGAAGCTCATACTGGCAGCCTGTAGTCGCTCGCCAGTTCCTAAGACAGTCCCCGTTCGGTCGCAGCGTGGTACCTCCGCGGCTGCTGGCGCAGGCGTCATCCGCCAGTCTGCAGGCTCCCGCGCGTCGTAGTCCTCCGCGCCGGACGGCCTCTTTCCGCCGCAGCCGGGAAGTCAAGTTCGCGGAGGTGCCGACAGACGAGCCGGCCGTGTGA

Protein sequence:

>DPOGS215597-PA
MKVQDMRTFREHHPTSITTATSGRRDGTLPRLLRAQLNRLVSSERRGVRGERKRGGEGAAGARSSARERGMAPARPAPATAAPWLNATLAYNLTRINTTFDTDYELLNASTTDEHDNHWFCAKWTSAQQDLFQAANLCFAIAFLAPKSFKQSILVLRALAAAGAVLMGMWAGAEVCAPDVLAWSLALVLVNSIHTIFLIIRFLPPALSLELTDLYLKLFKPLKVNKKHFQELTREARVIRLEPGEAYAVEEVTPADERLSILLKGKMRVTCDETHLHYIQPYQFVDSPEWEANREQSDDVFQVTVIAEEVCTCVCWPRMRLERVLRHRPALKVVLDCLIGKDITHKLYSVSEGLGAGATGEGEGPPAHLKRSASVDAVHEGARGRLRSLAWRARSTSKKGSSYWQPVVARQFLRQSPFGRSVVPPRLLAQASSASLQAPARRSPPRRTASFRRSREVKFAEVPTDEPAV-