Monarch geneset OGS2.0

DPOGS204765
TranscriptDPOGS204765-TA2604 bp
ProteinDPOGS204765-PA867 aa
Genomic positionDPSCF300231 + 79278-83200
RNAseq coverage717x (Rank: top 18%)
Annotation
HeliconiusHMEL0034910.061.25% 
BombyxBGIBMGA002847-TA0.050.69% 
DrosophilaCG31869-PC3e-3850.36% 
EBI UniRef50UniRef50_D6X1E22e-3752.07%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X1E2_TRICA
NCBI RefSeqXP_973923.13e-3852.07%PREDICTED: similar to CG31869 CG31869-PA [Tribolium castaneum]
NCBI nr blastpgi|910914086e-3752.07%PREDICTED: similar to CG31869 CG31869-PA [Tribolium castaneum]
NCBI nr blastxgi|1954333042e-4127.33%GK23980 [Drosophila willistoni]
Group
Gene OntologyGO:00055153.2e-07protein binding
KEGG pathway 
InterPro domain[84-143] IPR0010073.2e-07von Willebrand factor, type C
Orthology groupMCL26574 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204765-TA
ATGGAAAGGAAACGTTATGCGAGTCCTCTACCAAATGTGACGATGGTCGACATTGGACTAAAACAAGGGAGCTGCGCAGTCGGGGACGTGGTGTATATGTCGGGCGACTCGTTCCCGGGATCCAGCGCCTGCGAGCGCTGCGCCTGTTCTGCTGGGGAGGTGTCGTGTGAGAAGCAGCGCTGTGAACCTCGTCCTGGTTGCAAAGCCGTCCATCGCCCCGACCACTGCTGTCCGACCTATCAATGTGAATGCGAACAAGAAGGGCGCATCTACGGGAACGGAGAAAAGCTGGTGGATCCTCACGACCCGTGTCGCGTGTGCTACTGTCAGGGTGGGGAGGTGGTGTGCCGACGCATTGCCTGCTTCCTCCGAGACGACTGCCGTCCGCGTCTCGTTCCTGGCCGTTGCTGTCCCGAATATGACAACTGTCCGCTCAGAGGAGTAACATCTTTACCTGGTATGGCTTCAACGTCAAGTGTTTCGTCTGTGGAGGATGGTGAGAGTAGCATGGCACCAGCTGAACCGGTGGAACCTCCAAAGCCTGTCATCACTATAAAGGAAATCACTCCTGTGTCTGAAATTCCTGTAACAGATGTAAAAATTAAAGAAATTTTACCTTCTCCTGGCATCGAAGAAGTAGAAGTATATACATCTCCGAAATCTCAACTGATAGCTCGAGAGGCGACCTCTGAAAGGAATGTCAATGAAACAGAATCAGACGCAAAACAAGAGTCCTCCGATACAAAAGATGTTTCATTATCGTCTAAGATACCTACTAACTACAATCCAGCGAATTCCGATAATCCTAATGACGGGACGCCTACAAAGATCACTGTGTCAACCGTCGACAGCAATAATGAAGATTTTGCACTTTCTAAAATATCAAATGTAGTGGCAATGATGGGTTCTCCTTCTGAGCCAGACTCTCACATGTTATCTTTAACTACTAAAGCACCTGTAATGGAAGAGGAAGATTTGTCTTTTCTCGATCACAATCCCGCCTTCCCGCCGATACCTGATGATTTATCAGTATTGACTAATCACGACGATGAAATATTACCAGAACAAAACTTGGATATTGAGCATGGAGCCATGGATCATGAGGTAAATAATGTCGCATCCCCTGTTGTAACTGAGGCACCGATCTTCAAAGAAGCTACAACACTAAATCTCATCTCTAGTGCAGCTGTAGTCACGAAAGAGAATTCTCTGGAAACTATTGAAACAAGTACGCATAGATTAATCGATAATACACCATCATCAATCACAAAAGATAATCCTATGTTAAATATGCGATCTGTAATACCAACTGAAATTTTAAACGCCCCTTCTTTAATATCGGATGAGGTAACTGGAGATTTGTTAGATGTTACTGAAAATCCTGCAGTGTTATCTTCTACTAGCGAAAATGTTAATTCAACAGAATATCCTCTCAGTACCTCTGAAGAAAATAAAGAACTTGCAACGCACACAAATATGTCATATGCGGTAGAGAACGGAACATTCTTAAACCCCGAAGAATTTGCCGATAGTTCAAGTAGCAGTGAAGCCGTAACTCCTAAAGTAGACCAAGGGTTCACAGAATCTTTAATGACAAGTAATGGAAAGGGTTCGGACGAGACGACAAATTCTTTATCCAAATCCACTGACATTATTTCCCAAACCGAATTTAGTTCTATTCCTATAGAAACCAGTGACCAGAATCCGCACGAAACAATAGACAAGGAACGTAACGATACGATAGACTCTACTGTGAACTCTTTGGAATTAACGTCATTGCAATCTGTTCCTAACGAGTCTGAGATTGATACCGTATCGAGAAATATACCAATGGAATTAAAAGACCAATCTTTTGAAAATTCAGAAACTACAGAATTTATACTCACGTCTTTTGGATCCCAGGAAATTACTACGGATCCCGTTGAGTTAATAAACCCTGGATCTGATAGTGACAGAAACTCAGCTTTCATCGATCCATCAGGAGGGCGCAAAACGAATGTTCTCACTGATCTGATAAACTTAGTTGGTGACGTGGCTTCCATAAGCGATCACACAGAAGCCTCCGACATCGAGCGACAGACACTGAAACCTACAACTATCTCAGATTCTGAGGAATTGATTCCAGTCAATGTAGCGTCAAGCTACAAGAGCAAAAATAAAAATTGGAATCAGAACTCCATAACTGAGGTGCCGTTCAAAACTAAGGTTTCTAAACAAAAAGTTGTAGAAATTGAGGGAGAAGACGCGGACACCATCACAGACTCTCCCCCGCCATATGATAAAGTCGAACCAACGACTCGCCGCACCCTCATAGATAATGTGTCAGATGACAAAGTTGAAAACAACACCAAAATAACGGGTAAAAAAGATATAGAGATTATAACTCAATCATACGTACCAACCATACAAAGGAGGCCGACGAAGGTTGTCTTGAAGGGCGACGAATCCTCCGCCGAAGAAGACGGAACGACAGCCGATCCTGCGCTTGTTAAAGATAAAGACGAGAACAATACTTCAATAGAGGCCGAGACCTCTACAGAGGTCCACTTCATGGAGTCCGAATCGTCGGAACGGTCTACAGCCGAGCCGAGCGCCGCGCAGTAA

Protein sequence:

>DPOGS204765-PA
MERKRYASPLPNVTMVDIGLKQGSCAVGDVVYMSGDSFPGSSACERCACSAGEVSCEKQRCEPRPGCKAVHRPDHCCPTYQCECEQEGRIYGNGEKLVDPHDPCRVCYCQGGEVVCRRIACFLRDDCRPRLVPGRCCPEYDNCPLRGVTSLPGMASTSSVSSVEDGESSMAPAEPVEPPKPVITIKEITPVSEIPVTDVKIKEILPSPGIEEVEVYTSPKSQLIAREATSERNVNETESDAKQESSDTKDVSLSSKIPTNYNPANSDNPNDGTPTKITVSTVDSNNEDFALSKISNVVAMMGSPSEPDSHMLSLTTKAPVMEEEDLSFLDHNPAFPPIPDDLSVLTNHDDEILPEQNLDIEHGAMDHEVNNVASPVVTEAPIFKEATTLNLISSAAVVTKENSLETIETSTHRLIDNTPSSITKDNPMLNMRSVIPTEILNAPSLISDEVTGDLLDVTENPAVLSSTSENVNSTEYPLSTSEENKELATHTNMSYAVENGTFLNPEEFADSSSSSEAVTPKVDQGFTESLMTSNGKGSDETTNSLSKSTDIISQTEFSSIPIETSDQNPHETIDKERNDTIDSTVNSLELTSLQSVPNESEIDTVSRNIPMELKDQSFENSETTEFILTSFGSQEITTDPVELINPGSDSDRNSAFIDPSGGRKTNVLTDLINLVGDVASISDHTEASDIERQTLKPTTISDSEELIPVNVASSYKSKNKNWNQNSITEVPFKTKVSKQKVVEIEGEDADTITDSPPPYDKVEPTTRRTLIDNVSDDKVENNTKITGKKDIEIITQSYVPTIQRRPTKVVLKGDESSAEEDGTTADPALVKDKDENNTSIEAETSTEVHFMESESSERSTAEPSAAQ-