Monarch geneset OGS2.0

DPOGS215677
TranscriptDPOGS215677-TA2001 bp
ProteinDPOGS215677-PA666 aa
Genomic positionDPSCF300041 - 1007162-1023152
RNAseq coverage160x (Rank: top 52%)
Annotation
HeliconiusHMEL0040890.077.52% 
BombyxBGIBMGA003576-TA2e-6989.76% 
DrosophilaCG3345-PA7e-1432.20% 
EBI UniRef50UniRef50_UPI00021A6A509e-2338.65%UPI00021A6A50 related cluster n=1 Tax=unknown RepID=UPI00021A6A50
NCBI RefSeqXP_001869308.12e-2133.76%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3407140083e-2238.65%PREDICTED: fibrous sheath-interacting protein 2-like [Bombus terrestris]
NCBI nr blastxgi|3228014981e-3127.06%hypothetical protein SINV_11627 [Solenopsis invicta]
Group
KEGG pathway 
Orthology groupMCL17336 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215677-TA
ATGTCGTCGTCGGCAACATCAGTGGAACTTGATGATATTCTGGACGTGGATACGACTTTGATGACTCTCAGGAAACCGACGGAATTGAAAAAAGTTATCGATTCTATACCTAAACCGATACGAGCTGTTCCAAAGAATGGCTTGCCTCTATGGTACAGACTGGGACTCGAGTGCAAATTGCCGGTGCCGAAAATGCCCGCGGGGAAAATCATATTCTCACGCGGAAAAATTGGAGAGGATGTTCGACGCCTCGGTCTTGGAACTAATGCACCTCGACCAACATTCGACCTCACAGACCCATACTGCAACAATGTATCATACGACTACGTGCCCATGCATGATCCCCACTTGGCACATCACTTTGCACAGAAACCAGCGAGGAATAGGATGAAATTGCTCGGCTATTGCACCAAAGACGGAAGAGCTGTCTGTTCATTGAAGGAGTTTAATCAGTATCGTAAATATTTATACAATCAATTCATGGACAGGATCCACATGGAAATGAAAATGTTGGATGAGAGAGCCCGTGACGACTTAACGCTGAAAAGAGTAGAGGGTGATGTAGCTAGGCGGTTGCAAGTCTTCACGAAGTCGGAACGAGCAAGGGAACATTTAGAAAAAGTTGCTCAGGAACACGCTGATGAATGGGCTGAGAAAAAAAGACAAGCGAAGGAACAGGAAAAGAAAATTCAAGCTAGAATGAAGTATTTGGCCGAGTTTAATGAAAAACAGCGCAAGGATCGAGCGGCGAGGGCTCATGAAAAACAGAATCGAATTAAGCAGAGAGTACAGGCCGCAGCAGAGATGGAATTGCGACGTAAGATAACTATGGTGAGACAATGGAGAACAAATGAGAAGAAGAGATTGAACCGACTGAAGCAGGAAAGAGAAGCTAAACAGAAACGGCTAGAAGATGAGGCCAATAGTAAATGGAAAGCGCGTGTGGAAGCTCAAAATCTGAAGATACAGGAGGAAGCGCTTCTTCTTAAATTATACACAGAAGACATGAAAGCAGCTGCTACGAGGCGGGCCAAGAAAGCAGAAATATACGCATATAACACTGATTTGGAGTTGCAACGCATCAGACTTGCAAACTTGAAACTTATGCACGGGGGAACGGCAAAAGCAGCGAAACTCGTCAGAAAAATGGGAATTGAATTTGAAAAGTCAAAACGAGGTGCCAAAGGTATGAGTCAAGAGTTGGCGCAGCGTTTGGCGGCTGAGGCCATGACTTCAGCGGTTTCTACGCAAGGAGATAGGCTCATGACTCTAGGACAGGCCAGAGCACGCATGGATCTGGAGACGGTGCTACCCACAGCGCCCATTAAACTGTTGGATGAGATGTTAGAGACTGCGATATTAGTGTTCTCGAGAAGACGAGTGGATACCTTATTGCGAGATATACAAAGGATGGTTCGCTCTAAATCGGCGTCGATATTCTTCCAAAGCATACGACCTCCCAGCCCGAAGAAACGATCAAAAACACCAAGATGGTGTATGCAAGTGGAAGCTGAGTTGACTAGGCATTACGCGAAGAACCACACGATAACTGCCTCTCAGAAACCGTCTATGACGAAACTGGCAGAAGTGACGTTCTCAGATCGCGTTGAAGATCTTCCCGATCCTGTTGCTATACCGACCTCCCTTCCTGAGAGAAGGAAGCTTCTAGAGATGGTGAATGTGGCAGGCAGATTGATGTCTCGATCGGTCATTCAGAGAGTTTTCAAGGGAATGGACTTGACAGTGGGAAAAGTGACGCTCCCAAACATGAGACACCCAATGGAGTGGGGTGAGGCCGTTGAGGGTCTGTCCCAAGCGGTGGTGAGCAGTCGTGTGAGCCCTGATTGTGCGGACGTTAGAAGAGCTAGCGAGTTCCTCGCGCACAGGATCCTGTGCTCGCTCATGAGAGAAATGAAGGAAGAGAAGGCGAAACTGAAATTAAACTTAAACTCGGAAGATAATAAAGATTTTTACACTCCTGGAAATGTCTCGGCTGGATAA

Protein sequence:

>DPOGS215677-PA
MSSSATSVELDDILDVDTTLMTLRKPTELKKVIDSIPKPIRAVPKNGLPLWYRLGLECKLPVPKMPAGKIIFSRGKIGEDVRRLGLGTNAPRPTFDLTDPYCNNVSYDYVPMHDPHLAHHFAQKPARNRMKLLGYCTKDGRAVCSLKEFNQYRKYLYNQFMDRIHMEMKMLDERARDDLTLKRVEGDVARRLQVFTKSERAREHLEKVAQEHADEWAEKKRQAKEQEKKIQARMKYLAEFNEKQRKDRAARAHEKQNRIKQRVQAAAEMELRRKITMVRQWRTNEKKRLNRLKQEREAKQKRLEDEANSKWKARVEAQNLKIQEEALLLKLYTEDMKAAATRRAKKAEIYAYNTDLELQRIRLANLKLMHGGTAKAAKLVRKMGIEFEKSKRGAKGMSQELAQRLAAEAMTSAVSTQGDRLMTLGQARARMDLETVLPTAPIKLLDEMLETAILVFSRRRVDTLLRDIQRMVRSKSASIFFQSIRPPSPKKRSKTPRWCMQVEAELTRHYAKNHTITASQKPSMTKLAEVTFSDRVEDLPDPVAIPTSLPERRKLLEMVNVAGRLMSRSVIQRVFKGMDLTVGKVTLPNMRHPMEWGEAVEGLSQAVVSSRVSPDCADVRRASEFLAHRILCSLMREMKEEKAKLKLNLNSEDNKDFYTPGNVSAG-