Monarch geneset OGS2.0

DPOGS209440
TranscriptDPOGS209440-TA2289 bp
ProteinDPOGS209440-PA762 aa
Genomic positionDPSCF300275 - 239236-245277
RNAseq coverage227x (Rank: top 44%)
Annotation
HeliconiusHMEL0044944e-12651.84% 
BombyxBGIBMGA005858-TA3e-11045.21% 
DrosophilaCG6495-PA2e-4643.28% 
EBI UniRef50UniRef50_E0VZG23e-6149.10%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VZG2_PEDHC
NCBI RefSeqXP_002431506.15e-6249.10%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420221541e-6049.10%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|1700365291e-6532.87%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00055151.6e-10protein binding
KEGG pathway 
InterPro domain[177-265] IPR0111061.2e-28Seven cysteines, N-terminal
[324-365] IPR0021721.6e-10Low-density lipoprotein (LDL) receptor class A repeat
Orthology groupMCL24770 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209440-TA
ATGTGTGGTAGCGAGCTAGTGAGGGAGCTCCCACACAACACGACACGAGGACACTGCAGGGGGAGGAGGGAAGGGGTCAGTACTCACACCACTCCTGAACTTGAAGAACAGCGACATAGGACGAGCGGCGAGCTGGTAGGCACTGACGAGGAGCCAGGAGTCTCTGAGTCTCAGAGCAGCCTCGGAGACGCGCCCGGACACCTCGCACTGACTGCCGCCGCCCGCTCGACGGTCGATAAGTGGAAAGTGTCAACTATGAAAGAAGTGACACGACCACATCGTCGGCCCGCTTGTTATCACACTGTGAGCGTTCCCTCCGCCCGCGCTGTGTCGCTGTGTCGCGAGTCAGCCCGCATGCGCGCTATTATCGATCACAGAACATACAAAGCATCAGAGATAGTGTCAAAAGTATCAAATGTCTGTGTGGATACCAGTGTAGGGGAAGAGACCAGTTACATGACAGTGATTGTGATGGGTACACGGGCCTGGGTGTGGTGGCTGGTGGTGGGGGCCGCGTGCTCCCGGGAACTGGACCCCGGCTCGTGCGTCGCGCGTTTCGACGTCCAGCGAGATAAGATCATTCGCACGGAGGAGTCCCGGGAGATGGGGGCGAGGTATCTCTCCGAGCTCGACGTGGGCGCGCGGTCCGAGTGTCTGCGGCTCTGCTGTGAGACAGACTCCTGTGACGTCTTCGTCTTCGAAGAGAAGAGTCCTGGCAGCTGCTACCTGTTCAGCTGCGGCCCGCCCGAGGACTTCCGCTGTAAGTTCACGGCTCACGGGAACTTCAGCAGCGGCGTGCTGGCCATCAGCCGCCGCCTCGCCGAGCTTCAGGACAGTGAGCGACTCGCAACGCACGAGAGAGAACTGGCCAACTTACGGAACCCCACCAGCACGACACTGACCACGACCCTGGCGGCGCGCAGGACGGAGCCTCCGCCGCCCCAGCCGCCCCAGTCGCCCACAGCTAGACCCTGCAGTCGCTACCAGTTCGAGTGTCGTCGCGGCGGCGAGTGTATCGCCGTGTACAACGCCTGCGACGGCGTGCCGCAGTGCGCGGACGGCAGTGACGAGGCGCCCGAGCTGGACTGTCCTACGTCACCTCCGCCTCCGTCACCGCCGGCGTCTCCCCTGCCGCCCGCTCCGCGTATGCCGCTCTCGCCGCCCTCCACGACCTTAGCCTCGGTCGTCAGCAGACCGCCGCCGCCGCCCCTCACACAGGCTTTATCAGACAACCGGTTCACGGGGGTCGTGCCGGCAGAGGGCGACTTGCTCGACGGAGCGGAGTCCTGGCCGAGGCGTCTGGCGCAGGCTCAGCCGCATCGATACGCGGGGGGCGGGGCGGCCGGAGCGGGCGGATCGCACATCTTCAGTCACAAAGGCGGACTGCTCCAAGAGAGCTCCTTCGACCCGGCGCTGCCTCCCGCCTGGCCGCGCCGGCCCTGGACTCCGCAGCTGACGGCTCCGGACATGGAGTCGGAGCGCGCGTGGGGCGGGTACGGACGCGAATGGCCCGACGTGGAGCCCCGACGAGCTTGGCCCGTGTCGCGGCCAGAGACCGACGTACAAATGTATTTATCTTCTAAAACTATGCCGGAAATGCCCATGATATATTCGAGTCAACAGCAGACGCTACGGGACGGACCCAAAAAAGGCTTCGAGATAAGGAACAAAAACGCATTGGATTTCCCGAGCGAGCCTCCGCCGCGCCGTCTGCCCCCCTCCACGCTCGACCTGAGAGAGGAGAACAGGCCTATAGTCTTAGAAAACACGACGAAGAAGAAGCTTCTAAAAGAAAAGGAGGTGGATTTAAACAACGAACATCAAAATGTAGTAGCTGGCCGCGTCCCGGCTCCGAGCGGCGCAGAGGCTGCGAGGGCCGCGGGGGGCGGCGGGAGTGCGGGGGTCGCGAGGGAGCCCGCCGCCCTGCAGGCTCGCACGCGCTGGGCCAACGACGAGCACGACGGTCTCAGCGAGCACCCTCCCGCCGCCGTGCTGCTGCTGGTTCTCGGTACAGCGCCTCCCCAGCAACTTCCCTCACCGCCCTCTGGCCGCTCCCCCTCACCCCCTCCCCTGTGTTCCAGGCTCGCTGATGACGGCCTGCCTGGCGGGCCTGGCCGTGTGCCGCGCGCGAGCCTCCCGCCGCCGCCGCCGCTCTCACCCGCGCCTGGCGCTGGACGCCGATTACCTCGTCAACGGCATGTACCTGTAGCGGCTCGAGGCCTGGGGAGGTCGGGGAGCTGGGATCGCAGCTGTAGACTTTCGTTATATACGTTAATATTAATTCACTAA

Protein sequence:

>DPOGS209440-PA
MCGSELVRELPHNTTRGHCRGRREGVSTHTTPELEEQRHRTSGELVGTDEEPGVSESQSSLGDAPGHLALTAAARSTVDKWKVSTMKEVTRPHRRPACYHTVSVPSARAVSLCRESARMRAIIDHRTYKASEIVSKVSNVCVDTSVGEETSYMTVIVMGTRAWVWWLVVGAACSRELDPGSCVARFDVQRDKIIRTEESREMGARYLSELDVGARSECLRLCCETDSCDVFVFEEKSPGSCYLFSCGPPEDFRCKFTAHGNFSSGVLAISRRLAELQDSERLATHERELANLRNPTSTTLTTTLAARRTEPPPPQPPQSPTARPCSRYQFECRRGGECIAVYNACDGVPQCADGSDEAPELDCPTSPPPPSPPASPLPPAPRMPLSPPSTTLASVVSRPPPPPLTQALSDNRFTGVVPAEGDLLDGAESWPRRLAQAQPHRYAGGGAAGAGGSHIFSHKGGLLQESSFDPALPPAWPRRPWTPQLTAPDMESERAWGGYGREWPDVEPRRAWPVSRPETDVQMYLSSKTMPEMPMIYSSQQQTLRDGPKKGFEIRNKNALDFPSEPPPRRLPPSTLDLREENRPIVLENTTKKKLLKEKEVDLNNEHQNVVAGRVPAPSGAEAARAAGGGGSAGVAREPAALQARTRWANDEHDGLSEHPPAAVLLLVLGTAPPQQLPSPPSGRSPSPPPLCSRLADDGLPGGPGRVPRASLPPPPPLSPAPGAGRRLPRQRHVPVAARGLGRSGSWDRSCRLSLYTLILIH-