Monarch geneset OGS2.0

DPOGS212884
TranscriptDPOGS212884-TA3669 bp
ProteinDPOGS212884-PA1222 aa
Genomic positionDPSCF300333 - 55732-130026
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0075536e-15777.43% 
BombyxBGIBMGA004844-TA1e-13369.95% 
DrosophilaCG42843-PB3e-4635.76% 
EBI UniRef50UniRef50_D6X1A85e-5424.52%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6X1A8_TRICA
NCBI RefSeqXP_001812113.15e-6224.62%PREDICTED: similar to CG31714 CG31714-PB [Tribolium castaneum]
NCBI nr blastpgi|3454857273e-10528.93%PREDICTED: hypothetical protein LOC100123035 [Nasonia vitripennis]
NCBI nr blastxgi|3454857272e-10328.76%PREDICTED: hypothetical protein LOC100123035 [Nasonia vitripennis]
Group
KEGG pathwaydre:307182e-12 
 K02599 (NOTCH)maps-> Dorso-ventral axis formation
    Notch signaling pathway
InterPro domain[446-519] IPR0137831.2e-10Immunoglobulin-like fold
[445-534] IPR0130983e-07Immunoglobulin I-set
Orthology groupMCL16382 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212884-TA
ATGTATTACGATGAATCGTATAAATTGGGTGAACGGAAAGCACTCACCGTTACGGAGGAGAACATCACTAGGAATAACTGGTTACAGGTACAAAGTCCAGTTCTCGTCGTCGAGCTTACCTTGAATAGACTAGAAGGTACGCAGCTGAGAGCTCTGGGTCTCCTCTCAGTTTTTGGGTTCAATATGACCTACATGGTGAGAGGACCCAATGATGAACCTGGTCCAAAGGCCTGCAGTACAATAGAATGTCGCCTTCTAGGACACTGTTATGCTAGATATGATTACAAGGAATTCTACTGCGATTGTTTTGAGGGTTACTCGGGAGCCGATTGCGGAGTTGGGCCTTTATGTCCGAAGACCCCCAACATGTGCAAAAATGGAGGCACCTGCGGTCAAATGGGTCCAGCTGCAGTAAGTTGCATCTGCGCGCCGGGTTTCACGGGTGATCTTTGCGAGTCTCAGATCGAAGCACCTGAGTCACCACCCCTGGCTGCTCTCTTGAATGAAGTGTTGCAAGGGGTTTTGGGGCAGTTTCTTGTACAAAGTCCAGTTCTCGTCGTCGAGCTTACCTTGAATAGACTAGAAGGTACGCAGCTGAGAGCTCTGGGTCTCCTCTCAGTTTTTGGGTTCAATATGACCTACATGGTGAGAGGACCCAATGATGAACCTGGTCCAAAGGCTTGCAGTACAATAGAATGTCGCCTTCTAGGACACTGTTATGCTAGATATGATTACAAGGAATTCTACTGCGATTGTTTTGAGGGTTACTCGGGAGCCGATTGCGGAGTTGGGCCTTTATGTCCGAAGACCCCCAACATGTGCAAAAATGGAGGCACCTGCGGTCAAATGGGTCCAGCTGCAGTAAGTTGCATCTGCGCGCCGGGTTTCACGGGTGATCTTTGCGAGTCTCAGATCGAAGCACCTGAGTGTGGCATAGAAGAATGTTCTGAAGGCTGCACGACAGGCGCTTGTGATTGCAATCCCAAAGATACCGACGTCTCTTCCGCTCGTTTTGAAACAAGGCTGCAAATAGTGGACCAAGGAAGTATAAACATATCTCAAGAAATTATAAAACAGATGACTAACTATTTAAGAGCTTCCAATATAACTTTACACGATGAAATTGAAGTTTTAAACATTAGTGCCCCTGACGCGCTGGGTGCACGTACAGTGTGGCTTCGTGTGTGGGCGGAACGGCGAGATGCAGGAGCGTTAAGGACTGCTCTAGCACGTTTAGCTGCCACTCGCACACGCACTGACAGGCTGCGCTTGCTGCCTGCTATGCTGCATTTCGATATGCAACCTGCTCTTAGTTTACACGCACTGATCGTTAACCAGCGTCAGGAAGTGTGGGAAGGATCTGAATTTATATTGAGCTGCATGGCGTATGGGTCCCCTGACATAACATTTACTTGGTATAAAGATGGGGTTAAAATAAACTTTAATGGAACTACAAGAGATATTTGGACTCGAACGGTAGCAGAAGATGCATTAGGAAGGCGTATGTCAGTATTGGGTATTTCTGAAGCTAAGAAACCTGATAGTGGTCGGTGGTCTTGTTCCGCTGATGATGCTGGGAGGAGACGGTGCAGTGCCCTGAGACTATCCATACTACGTCCTCCCGATATAAGACTAGTGCCTTCCATGTTAACTGTAAACAAGAGAGATAATGTAAGCATCACATGTCTTGCTGGTGCGAGTCGTGTACACGGTGTATTGGGATTTAGTTGGGCCCGAGAACGATCTCTGTTGCCACAGGCGCCCGGTCGCGAGGTTTGGGAGGATCTTTACCCAGCGGGAAGCGTGCTTAAGCTATATAATGTACAGAAATCTGGAGAATTCCGTTGTCAAGTATCGTCAGTAGCGGGTACGAATACTAAAGCCGTAACAATGTGGACTCTCGGTTCTAAAGACGAGGCGTGTCCGAGTGAAGCATCCCACGGTCTACGATGGCCCAAAACTGCACCAGGAGCTCACGCTGCAACAACCTGTCCACCTGGACATACTGGGGAATCAATTAGATTCTGCGAACCTAAAACTACCCAACACGGTGTGAAATGGGTCATACCGGATTTTTCTGGTTGTGTTGCGGATTCCTTAAACGACATTTATGAGCAGTTTACTAAAATCTCTTACGGTTATTCGTGGGCTAATGTTTCTCATGTGGCACATCAATACGGAGCAGTTCTCCGTTCACTTCCCGCTCAACCTGGAGAGGGTACTATTCCCTTAAAACATGCTAAGAATATGCTTAACTATCTCCTCTCCAATGCTGGTAAATTGAAGGATAGAAGGGAAAGTGTGAACCATCTACTCATTATATATGATACGTTGCTAAAGCATCCTGACGCTTTTTTAGATGAAGAAAAAATATACGATCTTCAAAATGCTATAGTTGAAACAGCAGGAATGCGTGATAACCTGGTTTCCGTGTACAAGGAGTTTGCCATAAATACTAAGCAAGCTAGAGAAGATGGCGCTGCTCACTTTAGTTTCACGCCAATTCCAGGATCTGAAGAATGGTTACTAACATCCGCTGGTGTAGAACTGGTTGGACGAAATGGAAATACGTCTGTGGTCGTTGTACAGTATCGAAACTTAGCTGCACGACTGCCATCATTGAGAAGATCTATTGAATTCAACTCATCGTCTTCTAGGGGTGGTCGAGAAGTGGAGTATGAGCTGGCGTCCGCACAAACACAACTTCACGCGCCAGGTTATGCTCACAACGGACATTCCACCACGTTGCTGTTTGTGCATTCAAAAAATTATTCAGCAATAGCATCAAAATTAGCCTGCGCATTAAGAACCCCATCAGAACCGCGGGTCTGGATAACGAAGGCTTGTGAGGTTCGTGTACCTGAGCCAACGCACGTGGCGTGTCGGTGTCGAGGTCTTGGTACTTTTGCGTTATTCACTATCGCCAGGTCTACCCTTTCAGACACAGAAAAAGACCTTCGCGGAATTGTCAAGATCACAGTAGGATTGAGTGGTACGATGAGCCTGGTGGCTGCTGCATTGCAGCTTCTCAGTCTTTTACCGGGGAAACGAGCACGGTTGCCCGTCTTGTTGCGAGCTGTCACTGCGGGCACCCATTCAGCAGCCATGCTTACCCTCCTAGAGTGTGACACCAGACAAGAGGAGGCTTGTCCTGGAGCCTTAGGGTGGGTATGCGCAGCATGCTGGTGCGCTGGTTGTGCAGCACTGTGTGCACAGCCACTGCTACTTCAAGCTGAGCTAGCAGGTCGGCGACAAAATGCTCCTTCTGTCGCACTTCTTGGAGGTGTATGCACCCTTGCTTGGTTGACTGCGCGGCTGTGGGGTGGGGCTCCTCTCCAGATCGGAGCGGCAGCGCAGGCCGTGTGCGCGGCCGGTTGCACGCTATTAGCCGTGTTGTGTTTCGCTCTAGCAATTTGTGCTGCTGTAAGATTGAGGACTATAACGCATAAAGTTCCAGTCGAAAGACGGACATATTTGAGAGATCGGCGGCGTGTTGTACGACATACAATTGCGGTATTAGTGACGACTAGCGCAGCGCAAGCGGCCGGTGTTTGGTGGGCACAACCGGGACCTCGGACCCTTGCTCTCGTTTTAACACTCTCTATCACCGCTCTTCTTAACGATCAATTTGGGAATATAATTTCTAGTATATATAAAACAAGGCGGTGA

Protein sequence:

>DPOGS212884-PA
MYYDESYKLGERKALTVTEENITRNNWLQVQSPVLVVELTLNRLEGTQLRALGLLSVFGFNMTYMVRGPNDEPGPKACSTIECRLLGHCYARYDYKEFYCDCFEGYSGADCGVGPLCPKTPNMCKNGGTCGQMGPAAVSCICAPGFTGDLCESQIEAPESPPLAALLNEVLQGVLGQFLVQSPVLVVELTLNRLEGTQLRALGLLSVFGFNMTYMVRGPNDEPGPKACSTIECRLLGHCYARYDYKEFYCDCFEGYSGADCGVGPLCPKTPNMCKNGGTCGQMGPAAVSCICAPGFTGDLCESQIEAPECGIEECSEGCTTGACDCNPKDTDVSSARFETRLQIVDQGSINISQEIIKQMTNYLRASNITLHDEIEVLNISAPDALGARTVWLRVWAERRDAGALRTALARLAATRTRTDRLRLLPAMLHFDMQPALSLHALIVNQRQEVWEGSEFILSCMAYGSPDITFTWYKDGVKINFNGTTRDIWTRTVAEDALGRRMSVLGISEAKKPDSGRWSCSADDAGRRRCSALRLSILRPPDIRLVPSMLTVNKRDNVSITCLAGASRVHGVLGFSWARERSLLPQAPGREVWEDLYPAGSVLKLYNVQKSGEFRCQVSSVAGTNTKAVTMWTLGSKDEACPSEASHGLRWPKTAPGAHAATTCPPGHTGESIRFCEPKTTQHGVKWVIPDFSGCVADSLNDIYEQFTKISYGYSWANVSHVAHQYGAVLRSLPAQPGEGTIPLKHAKNMLNYLLSNAGKLKDRRESVNHLLIIYDTLLKHPDAFLDEEKIYDLQNAIVETAGMRDNLVSVYKEFAINTKQAREDGAAHFSFTPIPGSEEWLLTSAGVELVGRNGNTSVVVVQYRNLAARLPSLRRSIEFNSSSSRGGREVEYELASAQTQLHAPGYAHNGHSTTLLFVHSKNYSAIASKLACALRTPSEPRVWITKACEVRVPEPTHVACRCRGLGTFALFTIARSTLSDTEKDLRGIVKITVGLSGTMSLVAAALQLLSLLPGKRARLPVLLRAVTAGTHSAAMLTLLECDTRQEEACPGALGWVCAACWCAGCAALCAQPLLLQAELAGRRQNAPSVALLGGVCTLAWLTARLWGGAPLQIGAAAQAVCAAGCTLLAVLCFALAICAAVRLRTITHKVPVERRTYLRDRRRVVRHTIAVLVTTSAAQAAGVWWAQPGPRTLALVLTLSITALLNDQFGNIISSIYKTRR-