| Age | Commit message (Collapse) | Author |
|
- Add vanilla/contextual/all_memory from fullscale (different datasets)
- Flag dataset mismatch (5-dataset vs 3-dataset)
- Queue matching-dataset vanilla/contextual experiment
- Same-dataset comparison: rag_vector leads in success (54.2%)
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
|
- Add rag_vector_no_short (eta_short=0) and rag_vector_no_long (eta_long=0) configs
- Add ablation run script for 60p60s experiment
- Document ablation design and expected results in notes
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
|
Cleaner than quintile bins - no non-monotonic issue
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
|
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
|
- (c) replaced boxplot with mean+SE bars + trend line, much clearer
- Q5 vs Q1 p=0.003**, clear ascending trend across quintiles
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
|
- (a) RAG+Vector vs Reflection: only rag_vector, clear improvement bars
- (b) Vector growth over 60 sessions
- (c) Preference similarity quartile boxplot (Q4 vs Q1 p=0.018*)
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
|
- fig_method_comparison: normalized improvement vs reflection + learning curve
- fig_vector_analysis: vector growth + cumulative head-to-head advantage
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
|
- learning_and_vectors.png: learning curve, vector growth, cumulative advantage, efficiency
- method_comparison_bars.png: success/effort/timeout bar charts
- vector_similarity_60s.png: PCA, pref-vector correlation (r=0.046, p=0.054), heatmap
- vector_similarity_30s.png: same for 30 sessions
- vector_analysis.png: norm distribution + session range bars
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
|
- RAG rewrite adapter and vector preference pipeline in personalized_llm
- 60-session experiment queue scripts (reflection, rag, rag_vector, rag_rewrite)
- Vector-preference correlation analysis and visualization scripts
- Local reward model batch processing improvements
- Updated CLAUDE.md with full experiment documentation and notes
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
|
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
|
|
|
|
- Add collaborativeagents subproject with adapters, agents, and evaluation modules
- Update .gitignore to exclude large binary files (.whl, .tar), wandb logs, and results
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
|