Monarch geneset OGS2.0

DPOGS204574
TranscriptDPOGS204574-TA2508 bp
ProteinDPOGS204574-PA835 aa
Genomic positionDPSCF300376 - 34684-43068
RNAseq coverage156x (Rank: top 52%)
Annotation
HeliconiusHMEL0056090.078.89% 
BombyxBGIBMGA001979-TA0.061.04% 
DrosophilaCG6867-PA8e-16441.81% 
EBI UniRef50UniRef50_B4M6R92e-16542.44%GJ16585 n=7 Tax=Neoptera RepID=B4M6R9_DROVI
NCBI RefSeqXP_001843332.10.048.49%colmedin [Culex quinquefasciatus]
NCBI nr blastpgi|1700309140.048.49%colmedin [Culex quinquefasciatus]
NCBI nr blastxgi|1582948730.048.85%AGAP005849-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055151.3e-60protein binding
KEGG pathwayseu:SEQ_16493e-23 
 K13734 (sfb1)maps-> Bacterial invasion of epithelial cells
InterPro domain[581-830] IPR0031121.3e-60Olfactomedin-like
[325-419] IPR0137838.4e-17Immunoglobulin-like fold
[423-506] IPR0130981.3e-09Immunoglobulin I-set
[169-226] IPR0081602.5e-09Collagen triple helix repeat
[332-416] IPR0035993.6e-09Immunoglobulin subtype
[338-404] IPR0035985.6e-07Immunoglobulin subtype 2
Orthology groupMCL12900 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204574-TA
ATGACTTCGGACTTAATCAAGGATAATAAGAATAGAGGAGAAGTTGCTGCTACAGATGTGCCGTGCTGCAAGAAAATTACTGTGTTTGCCTGTGTCTCTGGCGTTTTTTCATTAATTTTGCATGCATATAGCTATGCAGAACTCTCGGCTATTAAAAGTCATGAAGAGTTACATTCTAGACACATAAATAAACTCATAGAAAATAGGATTCAAGAAAGATTTATAGAGTTAATGAGTACAACTAGCCCCCATAGATTAAAAAGAGATGCGATGCTTAAACAATCCCCCATCGAGGAGGATAATACGGTTGCTCCACACGTGGAATTTTTCAACCCTAAAATGAGACCAGAACTAGAAGAGAAAGACTCCATAGAAATGAAAAGAACCGGTGCCAAGGGACCTGCCCCGGGAGACGACACTTGGGTTTGGCTGACGAGCTACTCCAGGGTCCCATACAAAGTAGTTCAGGGGTTTTGCAAGGCTACTCAGGATTACTGTCCTCCTGGCGTCCAAGGACCAAAAGGTCCCATGGGTCACCCAGGTCCAAAAGGAGACAGGGGGTCACCAGGGGAGGCTGGCATACCTGGTAGCCCAGGTTCAGTAGGACCTTTCGGACCCCCTGGACCAAAAGGCGAACGTGGATTTCCGGGGAATCCTGGCTTAGATGGTAGAGATGGAGTGCCAGGAGAACCAGGACTTGATGGCTTGCCGGGGCGGAATGGGGCAGACGGAGCCCCGGGTAGGTACGGACAAGACGGGATACCAGGCAGGGATGGAATCCCAGGAAAAAATGGAAAGGATGGAAAAGATGGAAAAGTCGGAGCTCAGGGCCCACCTGGTATTCGAGGCCCTAAAGGCGAACGAGGTCCAATCGGCCCCAAAGGCCCGAAGGGAAATGACGGACTTAACGGAATACCCGGCAAGCCAGGACTATCCATCTATAACTACACCAAAGAAAACCAGATGTTCATTCCCCCTTCCTTTGCATTGGATAATCCGAGACTTATAGTAAGAGAGGGGGATACTATGAGATTGGACTGCAATCCCAAAGGCTTCCCTGAACCCATCATTGAATGGAGGAGAGCTGACGGCACACCCATTATTCAGGGTTCATGGCGTGACGCCTCCGTCAGTGGTCACGTGCTTAACATACCAAACGTATCTCGTTGGCACACCGGCAAGTATGTGTGTCTCGCTAACAACGGCATGCAGCCTCCCGCCAACCAGACCACGGATGTTGAAGTTAATTTCAGCCCATACATCAGGGTGCCAAACAACATAGTCTACGTATTCAACAAAACTGCCCAAATCGAGTGCGAGATTCAAGCCTGGCCGGAGCCAGTGCTGGCTTGGGAGTACGACGATGGAACAACAGTCGAGGGATCACACTACAAGATTGAGGTGGCGCCAACACCGGATCCCTGGAGGTGGATCATGAAGCTGGAGATACCTCACATCAATGAGCACGACATGCGCCAGTACATCTGCGTGGCCAAAAATGAACTCAATAACACAACCGTCAGAGGCTATATTAGACTGTCCCATCCTGGTCCGAAACAACAATCTCAGATACAACAACAACCACGCGAGTTCGGCTCCCCGCCGCCCACGTTGACCTCGTACGAAGAACTGTGCTCCGCCCAACGCTGCCCATCCTGCCCACGATGTGATCGAGCGCTCATGATCACGCCCATGAACGCCAGCTTAGGCAACAAGCCTCACCGGAATACCAATTGTCAGCTGTACGCGATCGGCAAACCAGTGTACCACAAGTACAAGGAGGAGTTGTTTGGTGCCTGGCTAAGAGATTCGAATTCCTCTGAAGCTCAGCGAGAGAAGCTGTGGACTACCCAGGAGAACGACGTGGAGAGATTGCACGAATTCCGGAGTAAGGCAAGCTTCAAGTCGGATAGAGTAGACGAGTTCCACAAACTCCAGAAACCTTTCTTTGGTAATGGTCACATAGTGTACAGCGGCTCTTTCTTCTATCAAGCCAACGAGTCCGGTACACCCGGCGACATTGTGCGCTACGACCTGACACAAAGCCGTATCAAATCAGCACATCTACCGCACGCGCAGGGCAGACTGTACACGGCACAACACAACCAAGTCGACTTCAGCGCCGACGACAACGGCCTCTGGGCGATTTACTCCATAGAAGGTTCGAATAACACAGCAGTTGCTAAGCTGAGCTTTGATCCCAACAAGGATGATCTTAATATAGACTATATCTGGAACATCTCCTTAAATCATAAACAAGTAGGTGAAATGTTCATAGTTTGCGGCGTCCTCTATGCGTTGGATTCCGCAACAGAACGCGACAGCAAAGTATCGATCGCCATTGACTTGTACCTTAGCAAGTCGATCGATGTCACACTGCAGTTCACGAATCCATTCAGAAAAACAACACAATTAGGCTACGATCACACGCATAAGGAACTATATTCCTGGGATAGGGGTAATCAGCTGACATATCCAGTCCGGTACAACGAACTTCCGGGCCCCTAA

Protein sequence:

>DPOGS204574-PA
MTSDLIKDNKNRGEVAATDVPCCKKITVFACVSGVFSLILHAYSYAELSAIKSHEELHSRHINKLIENRIQERFIELMSTTSPHRLKRDAMLKQSPIEEDNTVAPHVEFFNPKMRPELEEKDSIEMKRTGAKGPAPGDDTWVWLTSYSRVPYKVVQGFCKATQDYCPPGVQGPKGPMGHPGPKGDRGSPGEAGIPGSPGSVGPFGPPGPKGERGFPGNPGLDGRDGVPGEPGLDGLPGRNGADGAPGRYGQDGIPGRDGIPGKNGKDGKDGKVGAQGPPGIRGPKGERGPIGPKGPKGNDGLNGIPGKPGLSIYNYTKENQMFIPPSFALDNPRLIVREGDTMRLDCNPKGFPEPIIEWRRADGTPIIQGSWRDASVSGHVLNIPNVSRWHTGKYVCLANNGMQPPANQTTDVEVNFSPYIRVPNNIVYVFNKTAQIECEIQAWPEPVLAWEYDDGTTVEGSHYKIEVAPTPDPWRWIMKLEIPHINEHDMRQYICVAKNELNNTTVRGYIRLSHPGPKQQSQIQQQPREFGSPPPTLTSYEELCSAQRCPSCPRCDRALMITPMNASLGNKPHRNTNCQLYAIGKPVYHKYKEELFGAWLRDSNSSEAQREKLWTTQENDVERLHEFRSKASFKSDRVDEFHKLQKPFFGNGHIVYSGSFFYQANESGTPGDIVRYDLTQSRIKSAHLPHAQGRLYTAQHNQVDFSADDNGLWAIYSIEGSNNTAVAKLSFDPNKDDLNIDYIWNISLNHKQVGEMFIVCGVLYALDSATERDSKVSIAIDLYLSKSIDVTLQFTNPFRKTTQLGYDHTHKELYSWDRGNQLTYPVRYNELPGP-