Sign In

HD-Prot: A Protein Language Model for Joint Sequence-Structure Modeling with Continuous Structure Tokens

์ž‘์„ฑ์ž
  • Haebom
์นดํ…Œ๊ณ ๋ฆฌ
Empty

์ €์ž

Yi Zhou, Haohao Qu, Yunqing Liu, Shanru Lin, Le Song, Wenqi Fan

๐Ÿ’ก ๊ฐœ์š”

๋ณธ ์—ฐ๊ตฌ๋Š” ๋‹จ๋ฐฑ์งˆ ์„œ์—ด๊ณผ ๊ตฌ์กฐ์˜ ์ด์ค‘์„ฑ์„ ํ™œ์šฉํ•˜์—ฌ, ๊ธฐ์กด ์–ธ์–ด ๋ชจ๋ธ์ด ๊ตฌ์กฐ ์ •๋ณด๋ฅผ ๋ถˆ์—ฐ์†์ ์ธ ํ† ํฐ์œผ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ ๋ฐœ์ƒํ•˜๋Š” ์ •๋ณด ์†์‹ค ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ์—ฐ์†์ ์ธ ๊ตฌ์กฐ ์ •๋ณด๋ฅผ ๊ทธ๋Œ€๋กœ ํ™œ์šฉํ•˜๋Š” 'HD-Prot'๋ผ๋Š” ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ํ™•์‚ฐ ๋ชจ๋ธ์„ ์ œ์•ˆํ•˜๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ์„œ์—ด๊ณผ ๊ตฌ์กฐ๋ฅผ ๋™์‹œ์— ํ†ตํ•ฉ์ ์œผ๋กœ ํ•™์Šตํ•˜๊ณ  ์ƒ์„ฑํ•˜๋Š” ๋ฐ ์„ฑ๊ณตํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์‹œ์‚ฌ์  ๋ฐ ํ•œ๊ณ„

โ€ข
๊ธฐ์กด ๋‹จ๋ฐฑ์งˆ ์–ธ์–ด ๋ชจ๋ธ์—์„œ ๊ตฌ์กฐ ์ •๋ณด๋ฅผ ์ด์‚ฐ์ ์ธ ํ† ํฐ์œผ๋กœ ๋ณ€ํ™˜ํ•  ๋•Œ ๋ฐœ์ƒํ•˜๋Š” ์ •๋ณด ์†์‹ค ๋ฌธ์ œ๋ฅผ ๊ทน๋ณตํ•  ์ˆ˜ ์žˆ๋Š” ๊ฐ€๋Šฅ์„ฑ์„ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์—ฐ์†์ ์ธ ๊ตฌ์กฐ ํ† ํฐ์„ ํ†ตํ•ฉํ•œ ์ƒˆ๋กœ์šด ๋‹จ๋ฐฑ์งˆ ์–ธ์–ด ๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜(HD-Prot)๋ฅผ ์ œ์•ˆํ•˜๋ฉฐ, ์ด๋Š” ์„œ์—ด-๊ตฌ์กฐ ๊ณต๋™ ์ƒ์„ฑ, ๊ตฌ์กฐ ์˜ˆ์ธก, ์—ญ์ ‘ํž˜ ๋“ฑ ๋‹ค์–‘ํ•œ ๋‹จ๋ฐฑ์งˆ ๊ด€๋ จ ํƒœ์Šคํฌ์—์„œ ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ณด์ž…๋‹ˆ๋‹ค.
โ€ข
์ œํ•œ๋œ ์ปดํ“จํŒ… ์ž์›์œผ๋กœ๋„ ์ตœ์‹  ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ pLM๊ณผ ๋™๋“ฑํ•œ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•˜๋ฉฐ, ์–ธ์–ด ๋ชจ๋ธ ๋‚ด์—์„œ ๋ฒ”์ฃผํ˜• ๋ฐ ์—ฐ์†ํ˜• ๋ถ„ํฌ๋ฅผ ๋™์‹œ์— ์ถ”์ •ํ•˜๋Š” ๊ฒƒ์ด ํšจ๊ณผ์ ์ž„์„ ์ž…์ฆํ•ฉ๋‹ˆ๋‹ค.
โ€ข
์ œ์•ˆ๋œ HD-Prot ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ๋”์šฑ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•œ ์ถ”๊ฐ€์ ์ธ ์—ฐ๊ตฌ ๋ฐ ๋‹ค์–‘ํ•œ ๋‹จ๋ฐฑ์งˆ ๊ด€๋ จ ๋ฌธ์ œ์— ๋Œ€ํ•œ ์ ์šฉ ๊ฐ€๋Šฅ์„ฑ์„ ํƒ์ƒ‰ํ•  ํ•„์š”๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.
๐Ÿ‘