合成数据:生成式人工智能数据训练中隐私保护和数据利用并行的新路径
[关键词]生成式人工智能数据训练隐私保护数据利用合成数据[中图分类号]G250.73 [文献标志码]A [DOI] 10.19764/j.cnki.tsgjs.20241442[本文引用格式].合成数据:生成式人工智能数据训练中隐私保护和数据利用并行的新路径[J].图书馆建设,2025(4):111-119,134.
Synthetic Data: A New Path of Parallel Implementation for Privacy Protection and Data Utilization in GAI Data Training
MaliyamuAisikaier
[Abstract]ataisgediiatidiigotailmotitloai andvigorosleloeaitelgeotetcee becomeathoryprobemtobeconsidered.InthedatatrainingofGAltherearerisksofilgalacquisinofcorpusdataanddataleaage. ThetraditionalpathfanozationddedentatioineingpaallimplementatiofoivcpoteconddatalziG datatraininghasprovedtobeinadequateandaparadigmsiftisneededtofindanewapproachBasedonthehidencharacteristicsof GAl (204号 datatrainingandtolthientpvacitobiiiofrsoaldataaralelimplementatiofpcpoted data utilization in GAl datatrainingcanbeachievedthroughprivacy-enhancingtechnologies.Byintegratingprivacy-enhancingtechnologiesand promotingtheevelomentandaplictioftheticdataanarachcanbroviddfoachevingetiveblancebtpracy protectionanddatautlationtheebpomotigeeatevelopentofGteltoriggreateenftsndpogsstety [Keywords]GAl; Data training; Privacy protection; Data utilization; Synthetic data
0引言
以ChatGPT等前沿应用为代表的生成式人工智能技术的崛起,在民众日常生活与思想意识层面掀起了一场史无前例的变革。(剩余15600字)