E D R , A S I H C RSS

Bioinformatics


1. †Œ๊ฐœ


  • ฆ„ : Bioinformatics
  • ฐธ—ฌ :  •ฌก
  • ๊ธฐ๊ฐ„ : 2002. 3. 14 ~ 2002.8.xx
  • ”„กœ Šธ ‹œž‘™๊ธฐ™€   : ณธ —ฐ๊ตฌŠ” ฐจ„ธŒ€ Bio๊ธฐˆ —„œ “จ„ฐ  „๊ณตžกœ„œ  ‘๊ทผ•  ˆ˜ žˆŠ” ๊ธฐˆ ธ Bioinformatics— Œ€•œ ๊ธฐˆฅผ ‹ฆŠ” ๊ฒƒ„  œผกœ •œ‹ค.
  • ”„กœ Šธ ง„–‰ : ๊ตžฌ˜  •ฆฌ™€€ จ ›น‚ฌŠธ˜ งฌ, ๊€ จ ฌธ„œ  •ฆฌฅผ ฃผ•œผกœ •˜๊ฒ Šต‹ˆ‹ค. ‹จˆœ ฒˆ—ญ‹คŠ” ˜ฏธžˆŠ” žฌ •ฆฌ ๊ณผ •— …ธ ฅ„ ๊ธฐšธผ ˜ˆ •ž…‹ˆ‹ค.
  • ๊ตžฌ : โ€œBioinformatics: A practical guide to the analysis of genes and proteinsโ€, Second Edition edited by Baxevanis & Ouellette

2. ฑ… •ฆฌ


2.1. NCBI DataModel

™œ Model„ šฉ•˜Š”๊ฐ€? ‹ค œ— ๊ฐ€๊นŒš ธ€ ‹ค œกœ ผ–‚˜Š” ผ„ ‹ค ” ž˜ •‹œ‚ค๊ณ  ˜ˆธก๊ฐ€Šฅ•˜๊ฒŒ •œ‹ค.
Ÿฐ ทจ€—„œ NCBIŠ” sequence-related information— ๊€•œ ธ„ งŒ“ค—ˆ‹ค. ๊ทธฆฌ๊ณ  Ÿฐ ธ„ šฉ•„œ Entrez(data retrieval system)‚˜ GenBank DB(DNA seq.ฅผ  €žฅ•‘” DB, ‘ ๊ฐ€€Š” œ  „ž —ฐ๊ตฌ˜ ค‘š”•œ data“ค‹ค.)™€ ๊ฐ™ †Œ”„Šธ›จ–‚˜ †ต•ฉ DB‹œŠค…œ„ ๊ฐ€Šฅ•˜๊ฒŒ งŒ“ค—ˆ‹ค.

=== GenBank flatfile & format VS NCBI data model===
GenBank flatfile€ DNA-centered˜ ๊ณ „œ‹ค. DNAค‘‹ฌผŠ” ๊ฒƒ€ ––ค ‹จฐฑงˆ˜ œ  „ž  •ฅผ  €žฅ•˜๊ณ  žˆŠ” DNA˜—ญ DNAœ„˜ coding regionผ๊ณ  ถˆฆฐ‹ค. ฐ˜Œ€กœ Œ€€ถ„˜ Protein seq. DB“ค€ Protein-centered˜ ๊€ ฉฐ, Š” ‹จฐฑงˆ๊ณผ œ  „ž ‚ฌŠ” accesion number(œ  „žฅผ  ‘๊ทผ•˜๊ธฐœ„•œ DB˜ key๊ฐ’) ... ง„–‰ค‘

3. šฉ– ฐธกฐ

3.1. NCBI ž€

National Center for Biotechnology Information ถ„ž ƒฌผ  •ฅผ ‹คฃจŠ” ๊ตญ๊ฐ€ ธ žฃŒ›œผกœ„œ „คฆฝ˜—ˆœผฉฐ, NCBIŠ” ๊ณตšฉ DBฅผ งŒ“คฉฐ, ๊ณ„‚ฐ— ๊€•œ ƒฌผ•™— —ฐ๊ตฌฅผ Œ๊ณ  žˆœผฉฐ, Genome žฃŒฅผ ถ„„•˜๊ธฐ œ„•œ software „๊ตฌฅผ ๊ฐœฐœ•˜๊ณ , ƒฌผ•™  •ฅผ ๊ธ‰•˜๊ณ  žˆŠต‹ˆ‹ค. - ฆ‰, ธ๊ฐ„˜ ๊ฑ๊ฐ•๊ณผ งˆณ‘— ˜–ฅ„ ฏธน˜Š” ฏธ„ธ•œ ๊ณผ •“ค„ ‹ค ” ž˜ ••˜๊ธฐ œ„•œ “  ™œ™„ ˆ˜–‰

Established in 1988 as a national resource for molecular biology information, NCBI creates public databases, conducts research in computational biology, develops software tools for analyzing genome data, and disseminates biomedical information - all for the better understanding of molecular processes affecting human health and disease.

3.2. Entrez ž€

EntrezŠ” †ต•ฐ„ฐฒ Šค retrieval ‹œŠค…œœผกœ„œ DNA, Protein, genome mapping, population set, Protein structure, ฌธ—Œ ๊ฒ€ƒ‰ ๊ฐ€Šฅ•˜‹ค. Entrez—„œ Sequence, Šนžˆ Protein SequenceŠ” GenBank protein translation, PIR, PDB, RefSeqฅผ ฌ••œ ‹ค–‘•œ DB“ค— žˆŠ” „œ—„ ๊ฒ€ƒ‰•  ˆ˜ žˆ‹ค.
...ง„–‰ค‘

4. ƒฌผ•™๊ธฐˆ

4.1. ‰ ˆ˜ค‹ฐ“œ(nucleotide)ž€

DNA™€ RNAฅผ ๊ตฌ„ฑ•˜Š” nucleotideŠ” ธ‚ฐ๊ธฐ(Phophate), 5 ƒ„‹น(Sugar)ธ ””˜ฅ‹œกœŠค(deoxyribose), 4 ข…ฅ˜˜ งˆ†Œ —ผ๊ธฐ(Base) ค‘ •˜‚˜ฅผ ฌ••˜—ฌ 3๊ฐœ˜ €œ„(Phophate, Sugar, Base)กœ ๊ตฌ„ฑœ ฌผงˆ‹ค. ‹น€ ธ‚ฐ๊ณผ —ผ๊ธฐฅผ —ฐ๊ฒฐ‹œ‚จ‹ค. (šฉ–„ค…. ค‘•ฉ : งŽ€ ถ„ž๊ฐ€ ๊ฒฐ••˜—ฌ ฐ ถ„žŸ‰˜ ™”•ฌผกœ ˜Š” €™”)
ธ‚ฐ๊ธฐŠ” ATP—(๊ทผœก€  ATPฅผ †Œน„•„œ —„ˆ€ฅผ ‚ธ‹ค. ผข…˜ —„ˆ€›.) žˆŠ” ž˜ •Œ คง„ ‚ฐ„ฑ๊ธฐ‹ค. DNA ถ„žฅผ ๊ตฌ„ฑ•  •Œ—Š” ‹น— ง ‘ —ฐ๊ฒฐœ •˜‚˜˜ ธ‚ฐ๊ธฐงŒ ‚จŠ”‹ค. 5 ƒ„‹น ””˜ฅ‹œกœŠค(deoxyribose)Š” ATP˜ 5 ƒ„‹น ฆฌŠค(ribose)™€ งคšฐ œ ‚ฌ•˜‹ค. deoxyriboseŠ” ribose˜ 2ฒˆ ƒ„†Œ— žˆŠ” -OH ๊ธฐ Œ€‹  -H๊ธฐฅผ ๊ฐ€€๊ณ  žˆ‹ค. deoxyribose˜ 5๊ฐœ ƒ„†Œ—Š” 1ฒˆ—„œ 5ฒˆ๊นŒ€ ˆซž๊ฐ€ ™—ฌง„‹ค.

DNA— žฌ•˜Š” 4ข…ฅ˜˜ —ผ๊ธฐŠ” •„ฐ‹Œ(adenine), ๊ตฌ•„‹Œ(guanine), ‹ฐฏผ(thymine), ‹œ† ‹ (cytosine), šฐผ‹ค(uracil)‹ค. “ค ค‘—„œ ”ผฆฌฏธ”˜(pyrimidine)ผ๊ณ  €Š” thymine, cytosine, uracil€ งˆ†Œ™€ ƒ„†Œกœ ๊ตฌ„ฑœ 6๊ฐ˜•˜ ๊ณ ฆฌกœ ˜– žˆ‹ค. “จฆฐ(purine)ผ๊ณ  €Š” adenine, guanine€ ” ณตžก•˜—ฌ, งˆ†Œ™€ ƒ„†Œกœ ๊ตฌ„ฑœ 6๊ฐ˜•๊ณผ 5๊ฐ˜•˜ ค‘ ๊ณ ฆฌกœ ฃจ–ง„‹ค. nucleotide—„œ “ค —ผ๊ธฐ“ค€ deoxyribose˜ 1ฒˆ ƒ„†Œ— ๊ณตœ ๊ฒฐ•œผกœ —ฐ๊ฒฐ˜– žˆœผฉฐ, ธ‚ฐ๊ธฐŠ” 5ฒˆ ƒ„†Œ— —ญ‹œ ๊ณตœ ๊ฒฐ•œผกœ —ฐ๊ฒฐ˜– žˆ‹ค. adenine, guanine, cytosine, thymine, uracil€ ๊ฐ๊ฐ A, G, C, T,U กœ ‘œ๊ธฐœ‹ค.<๊ทธฆผ 1>

4.2. DNA VS RNA

•‚ฐ(Nucleic acid)ถ„žŠ” ฏฟ„ ˆ˜ —†„  •„กœ ๊ธ ค‘•ฉฐ, ๊ฐ ถ„žŠ” ๊ตฌกฐ ‹จœ„ธ nucleotideฅผ ˆ˜ฐฑงŒ ๊ฐœ”ฉ ฌ••˜๊ณ  žˆ‹ค.
Nucleic acidŠ” base˜ ข…ฅ˜™€ 5-carbon sugar˜ ข…ฅ˜, ถ„ž ๊ตฌกฐ— ”ฐผ DNA™€ RNAกœ ถ„ฅ˜œ‹ค.
•‚ฐ—ผ๊ธฐ˜ ข…ฅ˜5„‹น˜ ข…ฅ˜ถ„ž ๊ตฌกฐ
DNAA, G, C, TDioxyribose2ค‘ ‚˜„ 
RNAA, G, C, URibose‹จผ ‚ฌŠฌ

4.3. DNA

 ๊ทธฆผ€ DNA˜ ‹„‹ค.
DNAŠ” a twisted ladderผ๊ณ  ‘œ˜„˜Š”ฐ ‚ฌ‹คฆฌ˜ ๊ฐ๊ฐ˜ strandŠ” ‹น๊ณผ ธ‚ฐ˜ ๊ฒฐ•„ ˜ฏธ•˜๊ณ , lung€ Base“ค˜ ๊ฒฐ•„ ˜ฏธ•œ‹ค. Base“ค€ ‚ฌ˜ ๊ฒฐ•€ ˆ˜†Œ๊ฒฐ•„ ฃจŠ”ฐ, A™€ T, C™€ G๊ฐ€ ๊ฒฐ• ฃจ–ง„‹ค. ”ฐผ„œ DNAฅผ ถ„„• base“ค˜ ˆ˜ฅผ น„๊ต• A™€ T˜ ˆ˜๊ฐ€ ๊ฐ™๊ณ , C™€ G˜ ˆ˜๊ฐ€ ๊ฐ™Œ„ •Œ ˆ˜ žˆ‹ค. — •œฝ ๊ฐ€‹ฅ— žˆŠ” nucleotideŠ” ‹คฅธฝ ๊ฐ€‹ฅ˜ nucleotide „œ—„ ๊ฒฐ ••˜๊ฒŒ œ‹ค. ๊ทธž˜„œ ๊ทธ ‘ ๊ฐ€‹ฅ„ ƒ  (complementary) ผ๊ณ  •œ‹ค. ฆ‰, DNA ถ„žฅผ ˆ˜งœผกœ ๊ทธฆฌ •œ ๊ฐ€‹ฅ€ 5'—„œ 3'œผกœ œ„—„œ •„ž˜กœ ‹ฌฆฌ๊ณ , ‹คฅธ ๊ฐ€‹ฅ€ 5'—„œ 3'œผกœ •„ž˜กœ œ„กœ ‹ฌฆฐ‹ค.(5', 3' šจ†Œผ๊ณ  •Œ๊ณ  žˆŒ,  •™•žŒ ฆ„)

4.4. DNA Republication

™“Šจ๊ณผ ฌฆญ€ DNA˜ ๊ตฌกฐ, Šนžˆ Œ„ ฃฌ nucleotide˜ ƒ„ฑ œ  „ฌผงˆ˜  •™••œ ณต œ๊ธฐž‘˜ •‹ฌž„„ •Œ•˜‹ค. ๊ทธ“ค€ "šฐฆฌ๊ฐ€ ๊ฐ€ ••œ —ผ๊ธฐŒ ˜•„ฑ›ฆฌ๊ฐ€ œ  „ ฌผงˆ˜ ณต๊ธฐž‘„  œ‹œ•˜๊ณ  žˆŒ„ А‚„ ˆ˜ —ˆ‹ค."ผ๊ณ  ง•˜˜€‹ค. ๊ทธ“ค€ ค‘ ‚˜„ ˜ ‘ ๊ฐ€‹ฅ ถ„ฆฌ˜๊ณ  ๊ทธ ๊ฐ๊ฐ˜ ๊ฐ€‹ฅ„ ฃผ˜• (template)œผกœ •˜—ฌ ƒˆกœš ƒ  ‚ฌŠฌ ˜•„ฑœ‹คŠ” ‹จˆœ•œ ณต œธ„ งŒ“ค—ˆ‹ค.

4.5. DNA˜ —ผƒ‰‚—„œ˜ Žธ„ฑ(Organization)

ธ๊ฐ„˜ —ผƒ‰(chromosome)˜ ข…ฅ˜Š” 23๊ฐœ‹ค. 22๊ฐœŠ” ƒ—ผƒ‰(autosome)๊ณ  1๊ฐœŠ” „ฑ—ผƒ‰(sex chromosome)‹ค. •œ ข…ฅ˜˜ —ผƒ‰Š” „œกœ˜ Œ„ ๊ฐ€€๊ณ  žˆ‹ค. ”ฐผ„œ ธ๊ฐ„˜ —ผƒ‰๊ตฐ(genome)€ 46๊ฐœ˜ chromosomeœผกœ ๊ตฌ„ฑ˜– žˆ‹ค. chromosome€ „ธฌ‚—„œ Œ€€ถ„˜ ‹œ๊ฐ„„ ‹คƒ€ž˜(fiber)๊ฐ™€ ˜•ƒœกœ žˆŠ”ฐ.. Š” chromosome ๊ธฐณธ‹จœ„ธ ‰ ˆ˜ค†œ(Nucleosome)“ค ๊ฒฐ•œ ˜•ƒœ‹ค.  nucleosome€ •˜‚˜˜ žˆŠค†ค(histone)‹จฐฑงˆ„ DNA๊ฐ€ ‘ฒˆ œ˜๊ฐ€ ˜•ƒœ‹ค. --ž‘„ฑค‘

4.6. Genež€

œ  „ ˜•งˆ„ ง•˜ฉฐ œ  „— ๊€—ฌ•˜Š” Šน • ฌผงˆ‹ค. Gene˜ ž„ Genome‹ค. ˜•œ  GeneŠ” DNA— ๊ทธ ‚šฉ •”˜ธ™” ˜– žˆ‹ค. ฏธ •Œ๊ณ  žˆ„€„ ๊ฒ €งŒ, GeneผŠ” ๊ฒƒ€ DNA˜ —ผ๊ธฐ ฐฐ—‹ค.  —ผ๊ธฐ ฐฐ—(base sequence) ––ค ๊ณผ •„ †ต•„œ Œ€‘˜Š” ˆœ„œกœ •„ฏธ…ธ‚ฐ(amino acid)ผฆฌ˜ peptide๊ฒฐ•„ •˜—ฌ ‹จฐฑงˆกœ ‚˜ƒ€Š” ๊ฒƒ„ œ  „ ˜•งˆ ฐœ˜„ผ๊ณ  •œ‹ค.
šฐ„  ƒฌผ•™˜ •‹ฌ ก  Central Dogma(ค‘‹ฌก )— Œ€• •Œ•„๊ฒ ‹ค.
 ก € DNA๊ฐ€ ––๊ฒŒ ‹จฐฑงˆ„ ƒ„ฑ•˜Š” ๊ฐ€ฅผ —ฌฃผ๊ณ  žˆ‹ค.
๊ทธฆผ. 1
๊ทธฆผ 1„ ฐธกฐ•˜ DNAŠ” 2ค‘ ‚˜„ ˜• ๊ตฌกฐกœ ˜–žˆ‹ค. ๊ฒƒ „ธฌ ถ„— ๊ณผ •—„œ DNA— œ  „•”˜ธฅผ ณต‚ฌ•œ mRNAกœ ฐ”€Œฉฐ  mRNA๊ฐ€ Ribosome— “ค–๊ฐ€ tRNAŠ” mRNA— ‹๊ฒจžˆŠ” DNAœ  „•”˜ธฅผ ถ„„•˜—ฌ„œ Œ€‘˜Š” amino acidฅผ ๊ฐ€ ธ˜จ‹ค. Ÿฐ ๊ณผ • ฐ˜ณต˜๊ณ , amino acid‚ฌ—Š” peptide๊ฒฐ•„ ฃจ„œ Š” ‹จฐฑงˆกœ ˜•งˆ ฐœ˜„ œ‹ค. -- ง„–‰ค‘..

Bioinformaticsฅผ ๊ณต€•˜ คŠ” ‚ฌžŒ“ค„ œ„•

 ˆŒ€ “จ„ฐ €‹งŒœผกœ Šน€๊ฑธ ค๊ณ  •˜€ ง•„••  ๊ฒƒ ž…‹ˆ‹ค. “จ„ฐ €‹งŒœผกœŠ”  •ง ๊ธฐˆ ž ˆ˜€ ฐ–— ˜€ •‹ˆ‹ค. ๊ทธ€‹ •„š”•˜‹ค๊ณ  •„ ๊ฑ ๊ธฐˆ   €‹ผ๊ธฐ‹คŠ” ๊ณผ•™, ฆ‰,  „‚ฐ•™(Computer Science)˜ €‹ •„𔕋ˆ‹ค. ๊ทธฆฌ๊ณ  Bioinformaticsฅผ  œŒ€กœ ๊ณต€•˜ ค “จ„ฐ ถ„•ฅผ นผ๊ณ „ ตœ†Œ•œ ƒฌผ•™ ๊ฐœก , ถ„ž ƒฌผ•™, ƒ™”•™, œ  „•™, †ต๊ณ„•™ ๊ฐœก , ™•ฅ ก , ‹ค€Ÿ‰ †ต๊ณ„•™, ฏธ ถ„„ •Œ•„••‹ˆ‹ค. Ÿฐ ๊ฒƒ„ ๊ณ  ›ฐ–“ค๊ฒŒ ˜ ๊ฐ€žฅžฆฌงŒ Œ๊ฒŒ ฉ‹ˆ‹ค. ๊ตญ‚—„œ Bioinformaticsฅผ •˜ คŠ” Œ€€ถ„˜  „‚ฐ•™๊ณผ ๊ตˆ˜‹˜“ค  €ฅ˜— †•œ‹คŠ”   „œ๊ธ€”ˆ ‚ฌ‹คฃ .

 œŒ€กœ œ •ˆ‚ฅผ ฐ›œผ ค, ›„ธ—ฐ •‚ฌ‹˜˜ ‚ฌŠธฅผ ถ”ฒœ•‹ˆ‹ค. http://www.bioinformatics.pe.kr/ -- ๊น€ฐฝ€

DeleteMe QnAฅผ ฝ– •˜Š”ฐ ž‹ ๊ฐ 'š' –จ–€Š”๊ตฐš”.(๊ฒƒงŒ •„‹ˆ–‘ ˜คŠ˜ ๊ตžฌฅผ Š”ฐ ฒ˜ŒŠ” ƒ†Œ•œ ‹จ–“ค •Œฌธ— ‚ฌ „ ฐพœผž, ‚ฌŠธ Œ•„‹ค‹ˆฉฐ ––ค ๊ฑ€ •Œ•„ž, •งธŠ”ฐ..) ๊ทธž˜‘, ฆ„„ „ธฒˆ‚˜ ‹€ฆฐ  •™ฌธ ญ”€Š” •Œ๊ณ  ‹ถ๊ณ ,:) ‹คŒ— ˜น‹œ  €™€ น„Šท•œ €‹งŒ ๊ฐ€ง„ ‚ฌžŒ — ๊€‹ฌ„ ๊ฐ€€๊ณ  —ฐ๊ตฌฅผ •  •Œ „›€   ˆ˜ žˆ—ˆœผ •‹ˆ‹ค.

DeleteMe –„œ ”„กœ Šธผ •  •„˜ „ฑ๊ณผฅผ งŒ“ค–•ผ๊ฒ ๊ตฐš”.

Valid XHTML 1.0! Valid CSS! powered by MoniWiki
last modified 2021-02-07 05:22:36
Processing time 0.0314 sec