Monarch geneset OGS2.0

DPOGS208744
TranscriptDPOGS208744-TA1257 bp
ProteinDPOGS208744-PA418 aa
Genomic positionDPSCF300043 + 416899-424341
RNAseq coverage71x (Rank: top 66%)
Annotation
HeliconiusHMEL0152307e-8470.44% 
BombyxBGIBMGA003406-TA9e-5769.33% 
DrosophilaCG10407-PA9e-2331.64% 
EBI UniRef50UniRef50_G9F9L62e-7367.71%Seminal fluid protein CSSFP066 (Fragment) n=1 Tax=Chilo suppressalis RepID=G9F9L6_9NEOP
NCBI RefSeqNP_001155387.15e-2838.33%hypothetical protein LOC100159064 [Acyrthosiphon pisum]
NCBI nr blastpgi|3640236798e-7367.71%seminal fluid protein CSSFP066 [Chilo suppressalis]
NCBI nr blastxgi|3640236799e-7167.71%seminal fluid protein CSSFP066 [Chilo suppressalis]
Group
KEGG pathway 
InterPro domain[4-186] IPR0105623e-41Haemolymph juvenile hormone binding
Orthology groupMCL18815 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208744-TA
ATGAAGATCATTTTGTTAATTGTCACCTGTTCCGTCGCAGCTTCAGCTAAGATATTACCTGACGATTTTCCGCAATGCAAGAGGAACGATCCAGAATTGGAGAAGTGTATACTAGCAGCAGTCGAAGTAGTCAGACCGAGGCTGTTAAACGGCATCCCTGAAGTCAACATACCAACACTCGAGCCGTTCAATGTGCCAACATTGAAGCTGGACAGGACTGCCAACAATCTGAGATTGAAAGCCAACATTAAAAATATGAAAGCCGTCGGTGGATCCAAATTTACAATTGAAAAATTCAGATTGAACTTGAACAATAAGTACGTGGCGGAAATCAAACTAAGCATCCCGAAGCTGGTGGTAACTGCTGACTACGACGTGAAGGGATCCCGTATCCTCACCCTGGACATCAGTGGCCAGGGAAGATTCAGAAGCAACATCACTGGAATAACAGTGGTAGCAAAGGGAAACGCGAAACCAATCACAAAGGATGGTGTGGAGTATTTACAAGCTGAGAAGGCTATAACAAAAATAAGAATTACACACGCACAAATTGTGATTGACGATAATGAGCGACCAGTGGCTGCCAAGAAGCCAGTGTGCCACTACAGCGATCCCAACGTGGCCGAATGTATCAAACGCGTAGCAGAGCAGGCCAAGCAACTTCTGGCTCACGGCATCCCGTCATTCGGCATTCAGCCCTTAGAACCGCTGAAGATACCCAGCATCAGGTTGAGACAACACAACATGCCGCAAGCAAGATTTAAATATGACGCCTGGCTCACTGACCTTACTCTAAACGGATTAACAAACTACACCTTCAATAAATTGGATGTATATCCAGAAGAGATAAAAGTCAACGGGAACATAAGCCTACCAAGACTTGTAATGGGGGGAGAGTATGTTGTCATTGGGGAATTCCAGATGTTGCCAGTGGAATCCACAGGAAAGATGTCCGCAAACTTCACTGACTGTTCTGCTGCTCTCGATGCTGTTGGAGCTAAGGTTCACAAACGAATAGTAATCAAGGATGCCAATGTCAAGTTGCGCTGCACGGGAGCACTCAAGGCCAGCCTGATGGAGGCCCACTCCACTACTAGCGAGATGGAAATGATTACTGACCACATAATTCAAATGCACGCAAGCGAGCTAGCCAAGGAAGTGCAACCGGCCGTAGAGACAGCTCTGGCTATGGTCCTTGAAGATATCGCCAATAAATTCCTCAAGCACATACCCACTAACATGGTGTTCCCTATATAG

Protein sequence:

>DPOGS208744-PA
MKIILLIVTCSVAASAKILPDDFPQCKRNDPELEKCILAAVEVVRPRLLNGIPEVNIPTLEPFNVPTLKLDRTANNLRLKANIKNMKAVGGSKFTIEKFRLNLNNKYVAEIKLSIPKLVVTADYDVKGSRILTLDISGQGRFRSNITGITVVAKGNAKPITKDGVEYLQAEKAITKIRITHAQIVIDDNERPVAAKKPVCHYSDPNVAECIKRVAEQAKQLLAHGIPSFGIQPLEPLKIPSIRLRQHNMPQARFKYDAWLTDLTLNGLTNYTFNKLDVYPEEIKVNGNISLPRLVMGGEYVVIGEFQMLPVESTGKMSANFTDCSAALDAVGAKVHKRIVIKDANVKLRCTGALKASLMEAHSTTSEMEMITDHIIQMHASELAKEVQPAVETALAMVLEDIANKFLKHIPTNMVFPI-