| HN Points | HN Title (Links to original post) | Submitted Date |
|---|---|---|
| 14 | Show HN: Auto-generate hard evaluation data for LLMs | 2024-10-02 |
| 3 | Show HN: Talc (S23) Question and Answer Generation for AI Assistants | 2024-03-18 |
| 1 | Show HN: Talc – Custom benchmarking for LLM apps | 2023-11-01 |
| 3 | LLMs are still bad at handling dates | 2023-11-10 |
| 2 | OpenAI gets a C+ in high school English | 2023-11-17 |
| 2 | How do Google's code tips fail? | 2023-10-23 |