Monarch geneset OGS2.0

DPOGS205038
TranscriptDPOGS205038-TA2721 bp
ProteinDPOGS205038-PA906 aa
Genomic positionDPSCF300388 + 5800-14414
RNAseq coverage12x (Rank: top 83%)
Annotation
Heliconius% 
BombyxBGIBMGA001810-TA1e-8757.98% 
DrosophilaCG42668-PM8e-8368.37% 
EBI UniRef50UniRef50_E3WU982e-8165.91%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WU98_ANODA
NCBI RefSeqXP_002054255.13e-8264.41%GJ24349 [Drosophila virilis]
NCBI nr blastpgi|3123808536e-8165.91%hypothetical protein AND_06962 [Anopheles darlingi]
NCBI nr blastxgi|3123808532e-8055.40%hypothetical protein AND_06962 [Anopheles darlingi]
Group
Gene OntologyGO:00055152.4e-25protein binding
KEGG pathway 
InterPro domain[51-433] IPR0006488e-98Oxysterol-binding protein
[84-209] IPR0119932.4e-25Pleckstrin homology-type
[88-208] IPR0018495.4e-15Pleckstrin homology domain
Orthology groupMCL44331 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205038-TA
ATGAGTTCCTCTCCCGCCATAGAGACTTCGACCCCCACAGCCACACCCTCGTCACCAGCTCTGAGGTCCGCGGAATACCAGTTCCATACGTCAACACCGAATTATCTGAAGTCATCAGAAACAGCAGTGGGGGGTGTATCGCGGACCCCAAGCACAGCTCTCACACGAAAAGAGAGTTACAAAGCACAGAGGCAACACTACAGGAAGGAGAAGAAGAGGGCCGCCTCTGCGCTGTTACACTCTATAGAGGATCCGTCTGTTATAGTGCTCGCTGACTGGCTGAAGGTCCGAGGGTCTCTCAAGTCGTGGACGAAGCTATGGTGTGTGCTGAAGCCAGGGTTGCTGCTTCTCTACAAGAGTCCTAAGGCGAAACGCAGTCACTGGGTGGGTACGGTGTTGCTCACGTCGTGCCAGGTCATAGAGCGGCCCAGCAAGAAGGACGGCTTCTGCTTCAAACTCTACCACCCCCTGGAACAGAGCATCTGGGCGCCCAGGGGTCCGCACAACGAGACTATAGGCGCGGTGGTCCAGCCCCTGCCGACCGCTCACCTGATCTTCCGCGCCCCCTCCCCCGCCGCCGGACACTGCTGGCTGGATGGACTGGAACTAGCGCTGAGATGTAGCAATGCTATGCTAAGATGTTCTCGCTCTCGGCCGGAACAAGCTGTAGAGGACACGCCGGCGCAGACCAACATCTCGCACGAGGAGCTCGAGAAACACTTCAATGAGCACGTATTAAAGCTAGCCGAGTCCACAACCGGCTCGCCGTCACCGCTCCTATCCATAGCCGACTACTCACCCAGCAGCATCCGGTCCGAGAGGAAGAAAACTAAACACAACATCGAATACGACAAAACTAAGAGTTTGACCGCCAGCCCGTTCAGGATAGACTCGAACAGGATACTGAAGTGTTTCGAGCTGCCGAAACTGATGGCCGATAAGAGGGAGCTGTCCTCGCAGCTGGAGGAGGGCATCAAGTTGGTGTTCAAGCGGAGGTCCTCAGCGCCATCTATGTCCAAGAACAGTCTGAAAGTCAAAGCTTTGAGGGAATCCGGGTCCAAGACGGAGTCTGACGAGGAGGAGCAATCGAAGGTGTTGTTCAGCTCGCCGCAAAACTTTCTGATAACTAAAACGGCGCCATCTATCGAGCGTTTCGATCAAAGCGACGGCACCACCACCAGCGATGACGGCAGCGGAAATCCGATAGACAGAAAATATTCGCTAATAGACCACAATAACCTGCCGTCGGCCAGCTACGAGGCGGTGATGATGATGAGGAGGGACGAGAGGATGAGTGATCACAGCGTTTATCTCGCTGGGAAGATAGTCGGAGGAGGCGAACCTACCCTCACCCCACCGCCGCGATCTCCGGGTGCTCCGCTAGTGCCCCCTGCCCCGGGGTGTGGTGACCGAGCTCACCCCCCGGACGGGACCTCCTCAGACAAGTCATCAGAAACAGCAGTGGGGGGTGTATCGCGGACCCCAAGCACAGCTCTCACACGAAAAGAGAGTTACAAAGCACAGAGGCAACACTACAGGAAGGAGAAGAAGAGGGCCGCCTCTGCGCTGTTACACTCTATAGAGGATCCGTCTGTTATAGTGCTCGCTGACTGGCTGAAGGTCCGAGGGTCTCTCAAGTCGTGGACGAAGTTATGGTGTGTGCTGAAGCCAGGGTTGCTGCTTCTCTACAAGAGTCCTAAGGCGAAACGCAGTCACTGGGTGGGTACGGTGTTGCTCACGTCGTGCCAGGTCATAGAGCGGCCCAGCAAGAAGGACGGCTTCTGCTTCAAACTCTACCACCCCCTGGAACAGAGCATCTGGGCGCCCAGGGGTCCGCACAACGAGACTATAGGCGCGGTGGTCCAGCCCCTGCCGACCGCTCACCTGATCTTCCGCGCCCCCTCCCCCGCCGCCGGACACTGCTGGCTGGATGGACTGGAACTAGCGCTGAGATGTAGCAATGCTATGCTAAGATGTTCTCGCTCTCGGCCGGAACAAGCTGTAGAGGACACGCCGGCGCAGACCAACATCTCGCACGAGGAGCTCGAGAAACACTTCAATGAGCACGTATTAAAGCTAGCCGAGTCCACAACCGGCTCGCCGTCACCGCTCCTATCCATAGCCGACTACTCACCCAGCAGCATCCGGTCCGAGAGGAAGAAAACTAAACACAACATCGAATACGACAAAACTAAGAGTTTGACCGCCAGCCCGTTCAGGATAGACTCGAACAGGATACTGAAGTGTTTCGAGCTGCCGAAACTGATGGCCGATAAGAGGGAGCTGTCCTCGCAGCTGGAGGAGGGCATCAAGTTGGTGTTCAAGCGGAGGTCCTCAGCGCCATCTATGTCCAAGAACAGTCTGAAAGTCAAAGCTTTGAGGGAATCCGGGTCCAAGACGGAGTCTGACGAGGAGGAGCAATCGAAGGTGTTGTTCAGCTCGCCGCAAAACTTTCTGATAACTAAGACGGCGCCATCTATCGAGCGTTTCGATCAAAGCGACGGCACCACCACCAGCGATGACGGCAGCGGAAATCCGATAGACAGAAAATATTCGCTAATAGACCACAATAACCTGCCGTCGGCCAGCTACGAGGCGGTGATGATGATGAGGAGGGACGAGAGGATGAGTGATCACAGCGTTTATCTCGCTGGGAAGATAGTCGGAGGAGGTAATGCACGGGGGCTGGTTGTGAGCAACCTGCCTATAACGGTTGCATAA

Protein sequence:

>DPOGS205038-PA
MSSSPAIETSTPTATPSSPALRSAEYQFHTSTPNYLKSSETAVGGVSRTPSTALTRKESYKAQRQHYRKEKKRAASALLHSIEDPSVIVLADWLKVRGSLKSWTKLWCVLKPGLLLLYKSPKAKRSHWVGTVLLTSCQVIERPSKKDGFCFKLYHPLEQSIWAPRGPHNETIGAVVQPLPTAHLIFRAPSPAAGHCWLDGLELALRCSNAMLRCSRSRPEQAVEDTPAQTNISHEELEKHFNEHVLKLAESTTGSPSPLLSIADYSPSSIRSERKKTKHNIEYDKTKSLTASPFRIDSNRILKCFELPKLMADKRELSSQLEEGIKLVFKRRSSAPSMSKNSLKVKALRESGSKTESDEEEQSKVLFSSPQNFLITKTAPSIERFDQSDGTTTSDDGSGNPIDRKYSLIDHNNLPSASYEAVMMMRRDERMSDHSVYLAGKIVGGGEPTLTPPPRSPGAPLVPPAPGCGDRAHPPDGTSSDKSSETAVGGVSRTPSTALTRKESYKAQRQHYRKEKKRAASALLHSIEDPSVIVLADWLKVRGSLKSWTKLWCVLKPGLLLLYKSPKAKRSHWVGTVLLTSCQVIERPSKKDGFCFKLYHPLEQSIWAPRGPHNETIGAVVQPLPTAHLIFRAPSPAAGHCWLDGLELALRCSNAMLRCSRSRPEQAVEDTPAQTNISHEELEKHFNEHVLKLAESTTGSPSPLLSIADYSPSSIRSERKKTKHNIEYDKTKSLTASPFRIDSNRILKCFELPKLMADKRELSSQLEEGIKLVFKRRSSAPSMSKNSLKVKALRESGSKTESDEEEQSKVLFSSPQNFLITKTAPSIERFDQSDGTTTSDDGSGNPIDRKYSLIDHNNLPSASYEAVMMMRRDERMSDHSVYLAGKIVGGGNARGLVVSNLPITVA-