Monarch geneset OGS2.0

DPOGS207359
TranscriptDPOGS207359-TA1398 bp
ProteinDPOGS207359-PA465 aa
Genomic positionDPSCF300188 + 384853-387292
RNAseq coverage414x (Rank: top 29%)
Annotation
HeliconiusHMEL0088610.078.06% 
BombyxBGIBMGA008614-TA1e-11773.61% 
DrosophilaCG11807-PA4e-9238.46% 
EBI UniRef50UniRef50_E3XFJ81e-11344.21%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XFJ8_ANODA
NCBI RefSeqXP_394619.17e-11648.53%PREDICTED: similar to CG11807-PA [Apis mellifera]
NCBI nr blastpgi|3407293821e-11448.53%PREDICTED: nischarin-like [Bombus terrestris]
NCBI nr blastxgi|3407293826e-11448.53%PREDICTED: nischarin-like [Bombus terrestris]
Group
Gene OntologyGO:00055151.8e-21protein binding
GO:00071541.8e-21cell communication
GO:00350911.8e-21phosphatidylinositol binding
KEGG pathwayana:alr01244e-09 
 K13730 (inlA)maps-> Bacterial invasion of epithelial cells
InterPro domain[6-117] IPR0016831.8e-21Phox homologous domain
Orthology groupMCL13826 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207359-TA
ATGTCGTCTTTCATAAATTATCGTAATGAAAGTTCCGTAAATGTTTCTGATGCGTGCTTGAAAGATAATATAACTTTTTATAAAATCGTCGTGAATGTGGGTACAGTGCAGTGGAGCGTATCTCATCGGTACAGTGATTTTGTAGAGCTTCACGATAAATTAGTGGTCGACCATGGCGTTGCTAAAGACTTACTGCCACCGAAGAAAGTCATACGTAACAAGACGCCTAAATTTGTGGAACAGAGGCGTGAAGCGTTAAACGATTACCTGAAAAATGTATTTAATTATCTAAGGCTAACAATGCCATCAATATTTAGCCACTTTCTAGATTTCCATTTGTATGATATATTTTTCTTATTACAAGATTTGGCTAAGAAGTTATATTTAGAAGGCGATAAGTTATTACAAACAGAAAAAACTCATAAATTCAGTCTAATGGAGTTGCACGCAATCAGAGAGAGATATAAAATGGCCTGTCCACCAACAGAAAAGCAAGATCAGATGTACGACTTCAGCCACATTCTGGACTTCTGCTCTCAAGTGTCATCACTAAGTATCGAAGGGAGCTATGAAAAATTGGGCACAAGCAATATCATACCCAACGACTTACAAATAGATCTAATTCCCTTCAAAACATTGACCGACCTTAGGATCCTCGGTGTGCCAATGGCATGTATACAAAGTGTCGGTAGTCTCCGAGATTCCTTGATCAGTTTATCTGTACATATAGCAAGTGTGGAGACATTAAATGAGTTCCTGCTTGTGGATGTCCTCCACAAAGATCCTTCTAGTCTTGCCGATGTTGTTATATGGAAGAAGTTGTCAACAATAAATTTCGCTACAAACAACATAACCGATGTGGACTGGGGCATCAAACTAGTTCCAAAACTACAAAAACTGAACGTTTCTTCAAACAGATTATCCGAGCTATGTGATATATCTTGCCTGCACGAGTTGAGAGTCTTAAATCTTTCCATGAATCGTTTTTCATTGTGCCAGAACTGGCATGCTAAAATCGGCAACATAGTTAAGATAGATCTCTCACAAAATAAAATTGAAACTCTGCAAGGCTTCTCAAAACTATATTCACTCGAAAGCTTAGATTTGAGCTGCAATGTCATCACCGACGTGGAAGAGGTGCAGCATATTTGTAATCTGCCGTGTTTAGAATATCTATGGCTGACAGCGAACCCTGTCGCGTCAACCATAGACTATAGAGTGAAAGTCATAGAACAATTCAACTCGCGCATGACAGAGATATGTTTGGACAACGAAAAAGCGTCCGAAAAAGAACTGGACACATCAAGAGTTTTGCAAGCTCTGAGGATCGTAAAGGAAGGTAAAACGCCAAGTTTCCTACAGAACACCAATTGCAGTCACAGTTACAATAAAAGTTGA

Protein sequence:

>DPOGS207359-PA
MSSFINYRNESSVNVSDACLKDNITFYKIVVNVGTVQWSVSHRYSDFVELHDKLVVDHGVAKDLLPPKKVIRNKTPKFVEQRREALNDYLKNVFNYLRLTMPSIFSHFLDFHLYDIFFLLQDLAKKLYLEGDKLLQTEKTHKFSLMELHAIRERYKMACPPTEKQDQMYDFSHILDFCSQVSSLSIEGSYEKLGTSNIIPNDLQIDLIPFKTLTDLRILGVPMACIQSVGSLRDSLISLSVHIASVETLNEFLLVDVLHKDPSSLADVVIWKKLSTINFATNNITDVDWGIKLVPKLQKLNVSSNRLSELCDISCLHELRVLNLSMNRFSLCQNWHAKIGNIVKIDLSQNKIETLQGFSKLYSLESLDLSCNVITDVEEVQHICNLPCLEYLWLTANPVASTIDYRVKVIEQFNSRMTEICLDNEKASEKELDTSRVLQALRIVKEGKTPSFLQNTNCSHSYNKS-