PolyU research finds improving AI large language models helps better align with human brain activity

HONG KONG, May 27, 2024 /PRNewswire/ -- With generative artificial intelligence (GenAI) transforming the social interaction landscape in recent years, large language models (LLMs), which use deep-learning algorithms to train GenAI platforms to process language, have been put in the spotlight. A recent study by The Hong Kong Polytechnic University (PolyU) found that LLMs perform more like the human brain when being trained in more similar ways as humans process language, which has brought important insights to brain studies and the development of AI models.

Current large language models (LLMs) mostly rely on a single type of pretraining - contextual word prediction. This simple learning strategy has achieved surprising success when combined with massive training data and model parameters, as shown by popular LLMs such as ChatGPT. Recent studies also suggest that word prediction in LLMs can serve as a plausible model for how humans process language. However, humans do not simply predict the next word but also integrate high-level information in natural language comprehension.

A research team led by Prof. Li Ping, Dean of the Faculty of Humanities and Sin Wai Kin Foundation Professor in Humanities and Technology at PolyU, has investigated the next sentence prediction (NSP) task, which simulates one central process of discourse-level comprehension in the human brain to evaluate if a pair of sentences is coherent, into model pretraining and examined the correlation between the model's data and brain activation. The study has been recently published in the academic journal Sciences Advances.

The research team trained two models, one with NSP enhancement and the other without, both also learned word prediction. Functional magnetic resonance imaging (fMRI) data were collected from people reading connected sentences or disconnected sentences. The research team examined how closely the patterns from each model matched up with the brain patterns from the fMRI brain data.

It was clear that training with NSP provided benefits. The model with NSP matched human brain activity in multiple areas much better than the model trained only on word prediction. Its mechanism also nicely maps onto established neural models of human discourse comprehension. The results gave new insights into how our brains process full discourse such as conversations. For example, parts of the right side of the brain, not just the left, helped understand longer discourse. The model trained with NSP could also better predict how fast someone read - showing that simulating discourse comprehension through NSP helped AI understand humans better.

Recent LLMs, including ChatGPT, have relied on vastly increasing the training data and model size to achieve better performance. Prof. Li Ping said, "There are limitations in just relying on such scaling. Advances should also be aimed at making the models more efficient, relying on less rather than more data. Our findings suggest that diverse learning tasks such as NSP can improve LLMs to be more human-like and potentially closer to human intelligence."

He added, "More importantly, the findings show how neurocognitive researchers can leverage LLMs to study higher-level language mechanisms of our brain. They also promote interaction and collaboration between researchers in the fields of AI and neurocognition, which will lead to future studies on AI-informed brain studies as well as brain-inspired AI."

Media Contact
Ms Annie Wong
Senior Manager, Public Affairs
Tel: +852 3400 3853
Email: anniewy.wong@polyu.edu.hk

source: The Hong Kong Polytechnic University

【香港好去處】etnet全新頻道盛大推出！全港最齊盛事活動資訊盡在掌握！► 即睇

1	《盤前攻略》拼多多及百度績後捱沽，恒指彈力轉弱望守１９３００
2	【北水炒Ｄ乜】淨流入２５﹒６６億元，買舜宇光學科技沽騰訊
3	【大行炒Ｄ乜】百度績後遭麥格理削目標三成，高盛降小鵬至中性
4	【恒指季檢】恒指成分股納入快手及新東方，剔出新世界
5	恒指全日跌３７１點收報１９２２９，百度績後捱沽，內險股急挫
6	恒指中午休市前跌幅急擴至逾２００點，中資金融股領跌
7	百度上季經調整盈利遜預期遭大行降目標，股價插近一成可否撈底？
8	恒指半日跌２５７點報１９３４３，百度績後重挫９％成最差藍籌
9	《盤後部署》百度績後領跌恒指穿１９３００支持位，蘋概股逆市升
10	《鍾之日記》百度績後跳水，恒指連跌兩周

1	《菲常論證－溫蕎菲》攜程重上５００元關，小米績後回調
2	《窩輪豪情－梁業豪》投機者須留意股市波幅變化
3	《股林淘金－林家亨》小米汽車有改善，單車虧損降至三萬七
4	《缸邊隨筆－石鏡泉》滬深３００指數
5	《投資智慧－鄧聲興》緊抓熱點推新產品，攜程業績持續增長
6	《品中資－羅國森》四大內銀，「市值管理」未及格
7	《缸邊麗評－熊麗萍》市值管理出爐齊尋寶，中特估板塊堪留意
8	《投資心得－潘鐵珊》滙控簡化組織架構，有助提高決策效率
9	《陸言堂－陳永陸》地緣政局為全球市場帶來波動，黃金成避險佳選
10	《真知灼見－溫灼培》提防歐美通脹重回

1	高息定存 \| 中銀上調3個月至3.6厘，東亞新增至尊理財定存
2	高息定存 \| 工銀亞洲3個月存息加至3.6厘，華僑調整快閃優惠
3	李家超下周率工商界代表團訪大灣區，促進經貿合作
4	港股 \| 午市前瞻 \| 金監局新指示恒指跌幅擴大百度優勢大惟變現需時
5	順豐上市 \| 順豐今招股入場費7333元，引入小米、太保等基投
6	順豐上市 \|【FOCUS】慷慨派息+四面受敵，順豐招股謀國際化
7	大國博弈 \| 【FOCUS】油金股匯冷看「蘑菇雲」，惟普京底牌不止於此
8	港股 \| 蕭猷華：恒指料下試19000，惟下跌空間有限
9	47人案判刑 \| 首被告戴耀廷判囚10年，區諾軒判監6年9個月
10	基建債券 \| 基建債券明開售保底息3.5厘，專家建議抽幾多手? 一文看清認購優惠！

1	高息定存 \| 銀行紛搶存，恒生3個月加至3.6厘，創興高達3.9厘
2	高息定存 \| 中銀上調3個月至3.6厘，東亞新增至尊理財定存
3	美國大選2024 \| 2024美國大選即時結果，特朗普宣布勝利
4	理財通 \| 證監會：首批試點計劃券商名單出爐，續優化擴大理財通
5	恒指公司與沙特交易所簽署合作意向協議書，探索產品開發等
6	內地救市見效樓市有起色，惟再有內房抽水可以點揀？
7	港股 \| 蕭猷華：重磅消息來襲，股市勢必波動
8	美國大選2024 \|【FOCUS】侵侵勝券在握，防美元反高潮
9	瀚亞專家投資智慧：市場動盪下，低波幅如何成為避險關鍵？
10	美國大選 \| 【FOCUS】「垃圾」牽動選票，美媒各有盤算
11	高息定存 \| 一周高息合集，多家銀行加定存息，華僑3個月最高4厘
12	高息定存 \| 創興加3個月存息至3.6厘，渣打6個月3.48厘
13	高息定存 \| 特朗普勝選美元走強，富邦一個月美元定存5.98厘
14	港股 \| 午市前瞻 \| 人行買斷式逆回購刺激料有限內房板塊短線向好可吼
15	高息定存 \| 一周高息合集，銀行6個月最高3.6厘，3個月4厘
16	美國大選 \| 法國外貿銀行：若60%關稅屬實，損內地GDP增長率1百分點
17	恒指 \| 恒指午後升逾300點，人大常委開會期間中資金融股造好
18	高息定存 \| 工銀亞洲3個月存息加至3.6厘，華僑調整快閃優惠
19	2025 多元資產部署解鎖環球股匯債市潛力
20	神州經脈 \| 6萬億化債政策出台，滬指全周升逾5%，人幣跌
21	專訪 \| 洪灝：情緒不等於信心，市場關注人大會議勿捉錯用神（有片）
22	無人機 \| 美團：冀借助港府推動低空經濟，盡快拓香港無人機配送服務
23	大家樂牛油 \| 大家樂否認轉用內地牛油，澄清荷蘭生產自家品牌維寶牛油醬
24	澳門派錢 \| 澳門明年度預算案提出續推現金分享等惠民措施
25	美國減息 \| 【FOCUS】減息減了個寂寞，鮑公茫然下一步
26	【FOCUS】「X治國」2.0啟幕，新舊媒體權力交鋒
27	【FOCUS】國產機鬥內捲，小米鮎魚上身
28	滙控 \| 季績勝預期兼續回購，獲大行唱好股價創17年高，可以點部署？
29	高息定存 \| 渣打3個月存息減至3.3厘，虛銀逆市加至3.5厘
30	小傳日記 \| 打卡經濟盡在一杯？廉署賣咖啡想你不請自來！

PolyU research finds improving AI large language models helps better align with human brain activity

貨幣攻略

基建債券 | 基建債券明開售保底息3.5厘，專家建議抽幾多手...

大國博弈

戰爭壓倒歐盟火車頭，德國政經雙輸

傾力救市

提振A股 | 高盛：繼續給予A股市場「高配」建議

說說心理話