Monarch geneset OGS2.0

DPOGS214388
TranscriptDPOGS214388-TA2238 bp
ProteinDPOGS214388-PA745 aa
Genomic positionDPSCF300020 + 1104586-1125307
RNAseq coverage823x (Rank: top 16%)
Annotation
HeliconiusHMEL0206453e-13964.71% 
BombyxBGIBMGA004014-TA5e-10476.59% 
Drosophilacert-PA5e-7140.80% 
EBI UniRef50UniRef50_UPI00022468AE1e-7743.16%UPI00022468AE related cluster n=1 Tax=unknown RepID=UPI00022468AE
NCBI RefSeqXP_392830.29e-8045.78%PREDICTED: similar to CG7207-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3072053395e-8246.31%Collagen type IV alpha-3-binding protein [Harpegnathos saltator]
NCBI nr blastxgi|3072053392e-8146.31%Collagen type IV alpha-3-binding protein [Harpegnathos saltator]
Group
Gene OntologyGO:00055151e-18protein binding
KEGG pathway 
InterPro domain[560-737] IPR0233934.3e-32START-like domain
[557-738] IPR0029131e-21Lipid-binding START
[15-113] IPR0119931e-18Pleckstrin homology-type
[20-115] IPR0018496.2e-17Pleckstrin homology domain
Orthology groupMCL12443 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214388-TA
ATGACTGATGAAAATATAACTATGAGTGATAATTCGGACTTGGAAGATGGTAGTCCCCCAGAACTTAGAGGATATTTAAGTAAATGGACTAACTACATACATGGATGGCAAGATAGATTTATTGTTTTAAAGGAATCTACATTATCGTATTACAAAAGCGAATTGGAGAGTAATCTGGGTTGTCGTGGAGCACTATGCTTAAAAAAAGCTAAAGTCAAGCCACATGAATTTGACGACTGCAGATTTGATGTGTCCGTTAATGATTGCGTCTGGTACTTGAGAGCTTCCAACCCTGAAGAGAAGCAGCAGTGGATTGATGTACTGGAGTCATTCAAAGTGGAATCGGGATATGGTACGAGTTCGGACAGCAGTACAAATGGCAGCGGCCTTCGGAGACATGGTTCAGTTACATCGCTCCAGTCAACTGGATCTGCTGGTCGTACGAATCGCCGCCTGGCTGAGAAGATCGCGGAGTTTGACACATACACCGACCTGTTGTCCAAACAAGCTGTCACCCTCCAGGGTTACTTCGACATGGCAGCTGCCCAAATAAACGATGACGAGGCTATTGACGCTGTTAAATATGCTGATGGAATAAACCTCAGTCAATATTCACAAGAGGTGCGGGCTACGAGTGCATCTTTTCGTGCAACGTGTGCGGCCGCGGTATTGGCGGCTCAGACATGCGCTGACCTCCTGCGACGAAGAGAGACACGAGCGAGGCAGGCGGAGGAACTGTACATCAAGCCATTGATAGAGCAGTACTTCGGTGGAACTCTGGATGTAGAGTTAGTGTGCAGTGAGGCAGACGAGCCACCAACTCGGTCCACAGAGACCTTCCTGCAGCTCTCCTGCTTCATATCACAGGACGTCAAGTATCTACAGTCCGGACTCAGATCTGAAGGGCCACACTCGACGTTACCAGACGACGAGTTCTACGATGCTGTTGAGACAGGCTTCGATAAGATGGAGGAGGAGCGGTGTTCGAGGGTGGCGCCCCCAGCAGAACTCACCAGGGATGAGGTTGAACTGCCACCGCCGGCTGTTGATAACAGGCTCACGGTGCATACATTATGGCCAGAGATCGACAGGATATCCACAGAGCAAATACAGGCAGCGTTCGAGGGTGTCGGAGGTCAGATAGGATGGCAATTGTTCGCTGAGGAAGGTGACATGAGGATGTATAGGAGAGTGGAGGGATGGAGGGTTGAGGCAGGAGTGGAGACGCTCTCGGACCACCGCTACATACGGTTCGAAGTGTCTTCCACTCCTGCAGGTCGCCGGAGTCCAGCGTCGAGTTCCTCGTCACCCCGGGAGAGAAGTCGGTTCCCGCGTTGGGCACTGTCAAAGCTCAACCGGGAACTGGCTGAAGAAGCGGCCGTTGTCGGCCGCTGGAGTCTCCCGGAGAGTGCGGAGTTGGGGGTGGATGAAGGGGCTAGTCGCTTCGGAGACGTTCTCCAAAACGTCTGCAGAGCGGCGATGCCCCCCGTAGGACGTCCCCCCCCGCGGGGAGCGGTGTACTGGTGGTCGGACAACATCTCCGACCTCCGGGTCGCCTGCAACGGGGCCAGGAGGGCATATACCCGGAGTAGGCGACGCCGCCCCCAGGACGAGGAGCGAGATGGCCGGCTGTACAGGATCTACGTCGCGAAAAAGTTGATCCTGCAGCAGGCCATCTGCCGGGAAGTGGAAGTTGATGGTATGGTGATGGATCCATTGAAGGCGATGCACAAGGTACGAGGAGTATCAGCTCGGGAGATGTGCCATTACTTCTTCAATCCGAGATACAGATACGAGTGGGAGACTACACTAGAGAATATGAATATAGTTGAGGCGATCTCCAGCGACGCGATAGTGTTCCACCAGACGTTCAAGCGCATCTGGCCGGCGTCTCAGAGGGACGCGCTGTTCTGGTCTCACGTCCGGGCCGCGCCGCAGCACACGTACGCCGTCACCAACCACTCCACCACCAACGAACGGTACCCGGCAAACTCCGGAGCTTGTATACGTCTCATAGTGACTGTGTGTCTGGCGTGCCGCAGCGAGTGGCCTCCCGGGCAGCAACCCTCCAGGGATAACATTACCACTAGCATCGCGTATTGCAGTACAGTGAACCCCGGAGGCTGGGCGCCGGCGGGCGTGCTGCGGGCGGTCTACAAACGGGAGTACCCGAAGTTCCTTAAACGGTTCACAGGCTACGTGCTCGAGCAGTGCCGGGACAAACCGCTAGTCATGTAG

Protein sequence:

>DPOGS214388-PA
MTDENITMSDNSDLEDGSPPELRGYLSKWTNYIHGWQDRFIVLKESTLSYYKSELESNLGCRGALCLKKAKVKPHEFDDCRFDVSVNDCVWYLRASNPEEKQQWIDVLESFKVESGYGTSSDSSTNGSGLRRHGSVTSLQSTGSAGRTNRRLAEKIAEFDTYTDLLSKQAVTLQGYFDMAAAQINDDEAIDAVKYADGINLSQYSQEVRATSASFRATCAAAVLAAQTCADLLRRRETRARQAEELYIKPLIEQYFGGTLDVELVCSEADEPPTRSTETFLQLSCFISQDVKYLQSGLRSEGPHSTLPDDEFYDAVETGFDKMEEERCSRVAPPAELTRDEVELPPPAVDNRLTVHTLWPEIDRISTEQIQAAFEGVGGQIGWQLFAEEGDMRMYRRVEGWRVEAGVETLSDHRYIRFEVSSTPAGRRSPASSSSSPRERSRFPRWALSKLNRELAEEAAVVGRWSLPESAELGVDEGASRFGDVLQNVCRAAMPPVGRPPPRGAVYWWSDNISDLRVACNGARRAYTRSRRRRPQDEERDGRLYRIYVAKKLILQQAICREVEVDGMVMDPLKAMHKVRGVSAREMCHYFFNPRYRYEWETTLENMNIVEAISSDAIVFHQTFKRIWPASQRDALFWSHVRAAPQHTYAVTNHSTTNERYPANSGACIRLIVTVCLACRSEWPPGQQPSRDNITTSIAYCSTVNPGGWAPAGVLRAVYKREYPKFLKRFTGYVLEQCRDKPLVM-