summaryrefslogtreecommitdiff
path: root/notes.md
diff options
context:
space:
mode:
Diffstat (limited to 'notes.md')
-rw-r--r--notes.md43
1 files changed, 26 insertions, 17 deletions
diff --git a/notes.md b/notes.md
index 218e305..4cf5fd4 100644
--- a/notes.md
+++ b/notes.md
@@ -53,25 +53,34 @@
## Vector-Preference 相关性分析
-### 30s实验 (rag_vector_v3)
+### 60s实验 (rag_vector_60s, z_long)
**Vector统计**:
-- 59/60 users有非零vector
-- Norm: mean=0.0023, max=0.0051
-- Vector随session增长: 0.00086 (s10) → 0.00227 (s30)
-
-**Jaccard(revealed_prefs) vs Vector相似度**:
-- Spearman r = 0.09 (p=0.015) **显著**
-- 趋势正确: 更多shared preferences → 更高vector相似度
-
-| Jaccard范围 | 用户对数 | 平均Vector相似度 |
-|-------------|---------|------------------|
-| 0.00-0.05 | 413 | -0.007 |
-| 0.05-0.10 | 215 | 0.034 |
-| 0.10-0.15 | 43 | 0.037 |
-| 0.15-0.25 | 10 | 0.072 |
-
-**对比之前论文**: r=0.90 (5 users, 20 sessions) vs 当前 r=0.09 (60 users, 30 sessions)
+- 60/60 users有非零vector
+- Norm: mean=0.0033, max=0.0067
+- Vector随session增长: 0.0 (s0) → 0.0033 (s60)
+
+**Jaccard(revealed_prefs) vs Vector Cosine相似度**:
+- Spearman ρ = 0.040 (p=0.093, marginal)
+- z_long效果最好,z_short无信号,combined被稀释
+
+**Bottom 25% vs Top 25% preference similarity pairs** (报告用):
+- Bottom 25% (最不相似偏好): mean cosine = **-0.027**
+- Top 25% (最相似偏好): mean cosine = **+0.012**
+- Mann-Whitney U: **p = 0.021 \***
+- 结论: 偏好更相似的用户对确实学到了更相似的vector
+
+**不同vector对比**:
+
+| Vector | Spearman r | p | Q5 vs Q1 p |
+|--------|-----------|---|-----------|
+| z_long | 0.040 | 0.093 | **0.006** |
+| z_short | 0.010 | 0.681 | 0.586 |
+| combined | 0.046 | 0.054 | 0.003 |
+
+→ z_long捕获长期偏好趋势最好,z_short基本无信号
+
+**可视化**: `collaborativeagents/results/fig_main_results.png` panel (c)
---