Monarch geneset OGS2.0

DPOGS213283
TranscriptDPOGS213283-TA1851 bp
ProteinDPOGS213283-PA476 aa
Genomic positionDPSCF300429 - 16958-27532
RNAseq coverage19x (Rank: top 80%)
Annotation
HeliconiusHMEL0049212e-14859.82% 
BombyxBGIBMGA001777-TA6e-3727.42% 
Drosophilacrol-PE2e-3027.88% 
EBI UniRef50UniRef50_F1N7M49e-3932.38%Uncharacterized protein n=4 Tax=Bos taurus RepID=F1N7M4_BOVIN
NCBI RefSeqXP_002741707.14e-3930.70%PREDICTED: zinc finger protein 45-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2607808762e-4034.82%hypothetical protein BRAFLDRAFT_133163 [Branchiostoma floridae]
NCBI nr blastxgi|2607808763e-4934.82%hypothetical protein BRAFLDRAFT_133163 [Branchiostoma floridae]
Group
Gene OntologyGO:00036761.8e-09nucleic acid binding
GO:00082708e-06zinc ion binding
GO:00056228e-06intracellular
KEGG pathway 
InterPro domain[426-455] IPR0130871.8e-09Zinc finger, C2H2-type/integrase, DNA-binding
[316-336] IPR0227554.4e-06Zinc finger, double-stranded RNA binding
[409-430] IPR0070878e-06Zinc finger, C2H2
Orthology groupMCL34969 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213283-TA
ATGATCATAGAAATCGGTACGGAATCCTTCAAGCGCCAGTCCGACTCTCCTCACTTATACATTGCGATTAGTTCATTGCCGTCTGTAGTACAACTTTCTATTTCCGACACAAAAAACTATTATCTCGGACCCGAAGAGGTTGTTGCTGGTATCAAAGACGAGGGTAGTTTCATCGAGGAGGACGACATCCCTCTTGTATTCCTGAACGAACATTATGCCTGCGAGGACGATGATATAGAGGGTCAGTCGGATGTCAAGTTGGAGAAAGATGATTATAAGATCGCAGAAAGAAAAAAGATAAAGAAAGATGTGAAAGAAGGATTTTCATCGCGAATGGTGACTGAGACCGAGGAGTACGTCGTCATTAAATTGACTAAGGAACAGATTTTAGAGGAAATGCATCAACAGTCGATGACAGAGAAGTACAAAGTGCTTCCATACAAGTGCGACAAGTGTGTCCGCGGCTTTAACTTCGAGGACGTTTTACGGAAGCACATGGAGAAACACGATTCTAAAAATGGACCATTCCAATGCGAACTGTGCACCCAATACTGTCCAACGAAGGTTTCATTGAGGGGACATTTGAAGTCGCATTCAACAAGGTATAAATGTAAATTGTGCGGTATCGTGCGCTTGTCGCGACAGCACGTTTTGGAACATTACTCACTAGAACACACACACACAGCCACCGTATACAAATGTCCGCAGTGCGAACACACAACTAATAAGCGAACAGCCATGCAGCGTCACGTGCGACTCCACTCGAAATGCGAGCCGGTCAAGTGTGATCTGTGCGGAAAAACTTACAAGAGCAAAGAATCTCTCCGAATACACATAATGCGTCACGACGAGCAGAAGCTGCATCAGTGCGAGTTCTGCAGCAGCAGCTTCGTGTACGCGGCGCAGTTAAGGAAACACATACGCTCGGTGCACGAGAATAAGGACTATTATTGCGTGGAATGTGACATAATGTTTAAATCCATGGATAACTTGAAGCAACATTTGCAAAGAGCAAAACGTCACAGGGATTCTTCGTCGTACAAATACACGTGCCCTCAATGCCCAGAAAGATTTATCTCCCAATCAACGCTAGCCACCCATAGGACAAACGCACACGGAGCGGCCAAGAGCGAGAGCTGTTCCGTGTGCGCCAGGAAGTACAGCAGCCTGGAGGCGCTAAGGTGGCATACACGGAGGTGTCACACTACCGAACATACCAGGATAAAGTGTGACGTCTGCGATAGGAGCTTCTCGCGAGCCTACGTCCTCCGTGTGCACATGCGCACACACACAGGGGAACGCCCGCACATGTGTGAGTGTGGTGCGACCTTCACGCAGGCCGCGGGTCTTAGAGCTCATGTGGTGGCGAGGCATAAAGTAAGAATATATATATATATAACGAAAAATACTTTTTTAAAATATCATAGATAAACACCAAATATTGTTATCTGTGGTTCATTTTAATGTTGTGTGACGTATAACGAAACTGTTGCCAGGAAGAGAACTTGCAAAGTATTTTGATTGTGATATTAGAATAGACATATAAGTTACTAGCTTTAAATGTCGTTTCTATTCTTACAGGCTAGTGTTGTAAACTTTTTGCTGTGGACCTAACACAGATCGTAACATTCGCAGACATAACAATAAAATCTTTTCTAATAGAGACACTCATTAATCTTTTAAATCATTGTGTCATCCAAATCACTACATTATAACTTATATATATATACTGTACGTGTGTACGTCATTACTGGTTCAGATTCACAAACCGCCCCAACAGATGGCGCTGTCTGTATGGAGGTTATTTAACACGAGAAGCCCGCGACGCAGGTCACTAATGTACTGTCTTAA

Protein sequence:

>DPOGS213283-PA
MIIEIGTESFKRQSDSPHLYIAISSLPSVVQLSISDTKNYYLGPEEVVAGIKDEGSFIEEDDIPLVFLNEHYACEDDDIEGQSDVKLEKDDYKIAERKKIKKDVKEGFSSRMVTETEEYVVIKLTKEQILEEMHQQSMTEKYKVLPYKCDKCVRGFNFEDVLRKHMEKHDSKNGPFQCELCTQYCPTKVSLRGHLKSHSTRYKCKLCGIVRLSRQHVLEHYSLEHTHTATVYKCPQCEHTTNKRTAMQRHVRLHSKCEPVKCDLCGKTYKSKESLRIHIMRHDEQKLHQCEFCSSSFVYAAQLRKHIRSVHENKDYYCVECDIMFKSMDNLKQHLQRAKRHRDSSSYKYTCPQCPERFISQSTLATHRTNAHGAAKSESCSVCARKYSSLEALRWHTRRCHTTEHTRIKCDVCDRSFSRAYVLRVHMRTHTGERPHMCECGATFTQAAGLRAHVVARHKVRIYIYITKNTFLKYHR-