Monarch geneset OGS2.0

DPOGS202636
TranscriptDPOGS202636-TA2283 bp
ProteinDPOGS202636-PA760 aa
Genomic positionDPSCF300371 + 95088-108681
RNAseq coverage282x (Rank: top 39%)
Annotation
HeliconiusHMEL0102122e-16053.23% 
BombyxBGIBMGA008552-TA2e-11551.91% 
Drosophilaboi-PD1e-9033.55% 
EBI UniRef50UniRef50_E0VM311e-9335.76%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VM31_PEDHC
NCBI RefSeqXP_002427175.12e-9435.76%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420129244e-9335.76%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420129247e-9533.88%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00055151.3e-11protein binding
KEGG pathwayoaa:1000766061e-17 
 K06766 (NEO1)maps-> Cell adhesion molecules (CAMs)
InterPro domain[520-634] IPR0089571e-14Fibronectin type III domain
[436-516] IPR0137836e-14Immunoglobulin-like fold
[437-517] IPR0039611.3e-11Fibronectin, type III
[131-214] IPR0035993.2e-07Immunoglobulin subtype
Orthology groupMCL15895 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202636-TA
ATGGACGCGGTAAATCTGTTGTCCGTACATTTTTTGTCCCGTGACTCTCGCCCAGCGGATGCGGCAGAAATCGACATAAGATTCACCAAACATCCCGAGTCAGTATCAGCCCCTGTCGGCGACGAGGTGAGCTTCGAGTGCGCGGTGCGGGTGCCAGGGGAGAGGCTCGCGTGGCGGTGGAGATACGACTCTCCCAACTGGACGGACTGGACAGATCTGAACGGCTTCAACGACAAAGACGGAGTGTCCACACGACTCGTCGTTAAGATTAAAGAAGATACGGCCACTACTTACTATCAGTGCGTCGTTTGGTATGGAGCGATTAGTTTGGTGTCAACTCCCGCTCGTCTTTCGGTCGCTAAGTTGGACCTTAGTAGGAATAGTCCCGAAAAGAGAGTTATAACAGCACCTCTATACAACACAGTCGTAATACATTGTAGGGAGCCGTCATCAGAGCCTCCTGCTAGGCTTAGTTGGTGGAAGGAAACTAAGGGCTCCAGAAGAACTATAGATTCTCCGTACGGAGTACTTGTTATAAAAAACGCGACAGCTGAAGACAGTGGAACTTACGGTTGTACAGCGACTAATGATATCAGTGAACAGGTCGTGGACCTCCCGGAGAGAGTCTACCTTAGGGTGCAGCATGAAGGGAACGGGGGTTACAGGTTTTTGGAGTCTGAAGACTACGCAGGCACCATCCAAGATGGTGTTTTGACAGCTTCAGTTCCATCTGGTGGTATTTTTCGTGTGTGGTGTGACGCTGTTGGCAGTCCACCACCACACGTTCGCTGGACTGGCCCCAATGGTGTCATTGAAGCGAACCACGAGCTGGTGATAGAACGCTTTGATGAGAAACATGAGGGTGTGTATACTTGTTCTGCGAATGACATCCGTCGTTCGTGGAAGGTGATAGCGCTCCGGCCGCCTCAGTGGGAGGGTTCCGCAGCTAGTGTGAACGCTAGCGAGGGTTCCCCGGCCCTCATATCCTGCGGAATCCCACGCGGTCAACCGCCTCCCACCGTACATTGGATACATAACGCTGAACCCATCAATACCGGTCCCGGAGTACTAGCCACAGAGACCTCATTACGTTTGGATCATGTTGAGAAGCGACACGCGGGCGTGGTGCAGTGCGCGGCGTGCGGAGGGGGGAGGTGTGCCCTGGACGCTGCCTTACTCAGCGTTGTACCTGTACAGGTTAACGATCCCGATTATTCGTCTGAAGTGACTAAGACAATGCACGTACCGTCCCAGCAACCGAAGCGACACAACAAAAAGAATCCCAGGAAGCATAAAGCTGTATTAATTCCACCATCTCGTCCGAACGTGTCCCGTCTGTCTGACGAGAGCGTTATGGTATCGTGGTCCCATGACAACCACGGGCTTCCCATACAGTTCTTCAAGGTTCAATATAAAGAGGCGACTAATTCATCGAACGTCCAATGGCAGACAGCTAACCACGACATACCAGCACACATACACTCGTTTGAAATAGACGGACTCATACCAGACAAATATTATAAATTCAGAATAGCGGCCGTATATTCTAATCAGGATAACAAGTTGGGTCGCAGTAGCGGAAGGTTTTTCCTACAGCGAGGATCGTCACAGGCTCCGAGGAGACCCGTCTTGGATAATGCTGTTGCTCTGTCACCACACAGCATACAACTGAATTGGACGATGCCCCCTGACGGCGTAGCACCGGAGGGCTTCTACGTATACTACCGCGCGGTGTCTACGGCGGGCGCCTACGAGAAGGTCATAGCGGGTGGGTCTACGAGGTCTCTGATGCTGGAACACCTCTCGCCGGACACCGCCTACGAGCTCAAGATACAAGCATACATATCGAATGCCCCGAGCGACTTCAGCGCTATATTGGTCGTAGATTTGATTCCAGCCCGTGTTGAACGCCAAGACTCTCCGTGGTCCGGCGCCCCCCTCCCCGGAGCCCGCCCCTCCTCCCGCGAGTCCCCCCGCGCCCCCAGTGCCCTGGTGACGGCGGGAGGGGCGGGTGCCGCTGTCTTACTGCTGCTGCTCGCCGCCCTGGTGCTGATGTGCCGACGACGACGACCGGCGACCAAAGAGAAAGGTTCCGTCCCGGAAGGCTCTAACGGGTATCGCCCCGCCAAAGTTCCTATCAGCATCACAACAAATCCAATGCATACTGAGGGAAGCGAGCCCGGGGTGGAGATGTCGTTCCTTCACAACAACAACTGCGGTAACAATACCGACGGTGACGAAGCTCACTCCAGGAAGAACGGACCCTCGAGGCAGTACGTGTGA

Protein sequence:

>DPOGS202636-PA
MDAVNLLSVHFLSRDSRPADAAEIDIRFTKHPESVSAPVGDEVSFECAVRVPGERLAWRWRYDSPNWTDWTDLNGFNDKDGVSTRLVVKIKEDTATTYYQCVVWYGAISLVSTPARLSVAKLDLSRNSPEKRVITAPLYNTVVIHCREPSSEPPARLSWWKETKGSRRTIDSPYGVLVIKNATAEDSGTYGCTATNDISEQVVDLPERVYLRVQHEGNGGYRFLESEDYAGTIQDGVLTASVPSGGIFRVWCDAVGSPPPHVRWTGPNGVIEANHELVIERFDEKHEGVYTCSANDIRRSWKVIALRPPQWEGSAASVNASEGSPALISCGIPRGQPPPTVHWIHNAEPINTGPGVLATETSLRLDHVEKRHAGVVQCAACGGGRCALDAALLSVVPVQVNDPDYSSEVTKTMHVPSQQPKRHNKKNPRKHKAVLIPPSRPNVSRLSDESVMVSWSHDNHGLPIQFFKVQYKEATNSSNVQWQTANHDIPAHIHSFEIDGLIPDKYYKFRIAAVYSNQDNKLGRSSGRFFLQRGSSQAPRRPVLDNAVALSPHSIQLNWTMPPDGVAPEGFYVYYRAVSTAGAYEKVIAGGSTRSLMLEHLSPDTAYELKIQAYISNAPSDFSAILVVDLIPARVERQDSPWSGAPLPGARPSSRESPRAPSALVTAGGAGAAVLLLLLAALVLMCRRRRPATKEKGSVPEGSNGYRPAKVPISITTNPMHTEGSEPGVEMSFLHNNNCGNNTDGDEAHSRKNGPSRQYV-