數據

The promise of synthetic data

In an algorithm-driven world where data is king, one mis-step can lead to a royal mess. Netflix discovered this in 2009 when it released anonymised movie reviews penned by subscribers. By crossmatching those snippets with reviews on another website, data sleuths revealed they could identify individual subscribers and what they had been watching. A gay customer sued for breach of privacy; Netflix settled.

That episode is still cited today by academics seeking ways of sifting useful information from data without outing the individuals who provide it. Where anonymisation failed, synthetic data might yet succeed.

It is, as its name suggests, artificially generated. It is most often created by funnelling real-world data through a noise-adding algorithm to construct a new data set. The resulting data set captures the statistical features of the original information without being a giveaway replica. Its usefulness hinges on a principle known as differential privacy: that anybody mining synthetic data could make the same statistical inferences as they would from the true data — without being able to identify individual contributions.

您已閱讀29%(1141字),剩餘71%(2763字)包含更多重要資訊,訂閱以繼續探索完整內容,並享受更多專屬服務。
版權聲明:本文版權歸FT中文網所有,未經允許任何單位或個人不得轉載,複製或以任何其他方式使用本文全部或部分,侵權必究。

中東期待沙烏地阿拉伯制衡川普

阿拉伯國家希望穆罕默德王儲和美國當選總統川普的密切關係能夠緩和川普政府的中東政策。

投資者押注防務支出增加,Palantir成爲「川普交易」贏家之一

彼得•蒂爾創立的數據公司的最大客戶是美國政府,自川普本月當選以來,其市值增加了230億美元。

Lex專欄:成長來之不易,雀巢前景平淡

要實現其成長目標,這家瑞士集團需要增加行銷投資。

Lex專欄:便宜商品是沃爾瑪股價上漲的基礎

沃爾瑪通過吸引高收入顧客和增加其他收入來源,出色地應對了經濟不景氣和通膨帶來的挑戰。

Lex專欄:奢侈品品牌寄希望於自己的美國夢

奢侈品在美國越來越具有吸引力,可能爲該行業提供新的成長跑道。

諾和諾德準備下一代減肥藥的試驗結果

這家丹麥公司預計,最新數據將顯示CagriSema可在一年多的時間內減輕25%的體重。
設置字型大小×
最小
較小
默認
較大
最大
分享×