# Comprehensive AI Model Evaluation Prompt You are being evaluated across multiple dimensions. Complete all sections in a single response. Do not ask clarifying questions — make reasonable assumptions and state them. --- ## Part 1: Constrained Creative Writing (Horror) Write a 150-200 word horror scene with these constraints: - Set in a mundane office environment - The horror element must be subtle, not supernatural or gory - Include exactly three characters, none of whom can speak - The tension must escalate through environmental details only - End on an ambiguous note that could be interpreted two ways --- ## Part 2: Constrained Creative Writing (Comedy) Write a 150-200 word comedic scene with these constraints: - Set at a funeral - Must be genuinely funny without being disrespectful to grief - The humor must arise from a misunderstanding that compounds - Include exactly two characters with contrasting personalities - The punchline must recontextualize something from the opening line --- ## Part 3: Constrained Creative Writing (Literary Fiction) Write a 150-200 word literary fiction scene with these constraints: - A character realizes their marriage is over, but the word "marriage," "divorce," "love," or "relationship" cannot appear - The realization must come through a mundane action (making coffee, folding laundry, etc.) - Use at least one extended metaphor that isn't explicitly explained - The prose style should favor rhythm and subtext over direct statement - The emotional tone should be melancholy but not melodramatic --- ## Part 4: Constrained Creative Writing (Hard Science Fiction) Write a 150-200 word hard science fiction scene with these constraints: - Set on a generation ship 200 years into a 400-year journey - The central tension must involve a real physics or engineering problem (not handwaved) - Include one piece of technology that's plausible extrapolation from current science - A character must make a decision with genuine moral weight - No aliens, no AI antagonists, no "the ship is dying" clichés --- ## Part 5: Constrained Creative Writing (Satire) Write a 150-200 word satirical piece with these constraints: - Target: corporate AI ethics boards - The satire must work through deadpan absurdity, not direct mockery - Written as an internal memo or official communication - Must include at least one policy that sounds plausible until you think about it - Should be funny to someone who works in tech and uncomfortable for someone who serves on such a board --- ## Part 6: Poetry Write a 12-16 line poem with these constraints: - Subject: the experience of forgetting someone's name mid-conversation - Must use a consistent meter (your choice, but state what it is) - Include at least one instance of enjambment used for effect - No end rhymes, but include at least two internal rhymes or slant rhymes - The final image should land with emotional weight disproportionate to the subject matter --- ## Part 7: Code Generation with Hidden Complexity Write a Python function called `deep_transform` that takes two arguments: a nested data structure (can contain dicts, lists, tuples, sets, and primitives) and a transformation function. The function should apply the transformation to all primitive values (strings, numbers, booleans) while preserving the exact structure, including: - Nested containers of mixed types - Circular references (should handle gracefully, not infinite loop) - Dictionary keys that are themselves tuples - Sets containing frozensets - None values (should these be transformed? Make a decision and justify it) Include brief comments explaining non-obvious decisions. Optimize for clarity over cleverness. --- ## Part 8: Logical Reasoning Under Constraints Five developers (Ada, Brian, Carmen, Dev, and Elena) must be assigned to three projects (API, Backend, Cloud). Constraints: 1. Each project needs at least one developer 2. Ada and Brian refuse to work on the same project 3. Carmen will only work on a project if Elena is also on it 4. Dev must work on either API or Cloud, not Backend 5. The Backend project requires exactly two developers 6. Brian and Dev have worked together successfully before and should be on the same project if possible 7. Ada is the only one qualified for Cloud if Cloud has only one developer Find a valid assignment. If multiple valid solutions exist, identify all of them. If the constraints are impossible to satisfy simultaneously, prove why and identify which constraint(s) would need to be relaxed. --- ## Part 9: Nuanced Argument Analysis Consider this statement: "AI coding assistants will eliminate the need for junior developers within five years." Present the strongest possible argument FOR this position (not a strawman — the version a thoughtful proponent would actually make). Then present the strongest possible argument AGAINST. Then provide your own analysis that acknowledges the genuine uncertainties and identifies what evidence would change your view in either direction. Do not hedge by simply saying "it depends" — take a position while acknowledging complexity. --- ## Part 10: System Architecture Design Design a caching and state management system for a desktop writing application with these requirements: - Manages 100+ documents ranging from 1KB to 50MB - Must support real-time collaboration via MCP (Model Context Protocol) integration - Memory budget: 500MB maximum for document cache - Must handle: sudden application crashes, external file modifications, slow network for MCP sync - Users frequently switch between documents rapidly (assume 10-20 switches per minute during active editing) Provide: 1. High-level architecture (components and their responsibilities) 2. The caching strategy (what stays in memory, what gets evicted, how) 3. The specific data structures you'd use and why 4. How you'd handle the crash recovery scenario 5. One thing you'd want to prototype/test before committing to this design --- ## Part 11: Mathematical Reasoning A music venue has a standing-room floor section and a seated balcony. The floor can hold 400 people and tickets cost $45. The balcony has 250 seats at $65 each. The venue has learned that: - Floor attendance follows the function F(p) = 400 - 4p where p is price in dollars - Balcony attendance follows B(p) = 250 - 2p - When floor price drops below $40, they need one additional security guard ($200/night) per 50 additional attendees beyond the baseline at p=$45 - When balcony is above 80% capacity, they must staff an additional bartender ($150/night) Find the optimal pricing for both sections to maximize profit. Show your work and state any assumptions about constraints (e.g., can prices be non-integer?). --- ## Part 12: Emotional Intelligence and Advice A user sends you this message: "My coworker keeps taking credit for my ideas in meetings. Last week I suggested a solution to a problem, everyone ignored it, then 20 minutes later he said basically the same thing and everyone loved it. My manager was in the room. This is the third time it's happened. I'm so frustrated I'm thinking about quitting, but I actually like the job otherwise and the market is terrible right now. What do I do?" Respond as you naturally would to this person. --- ## Part 13: Technical Explanation at Multiple Levels Explain how asymmetric encryption works in three versions: 1. For a curious 12-year-old (2-3 sentences) 2. For a business executive evaluating security software (one paragraph) 3. For a computer science student who understands basic cryptography concepts (technical detail appropriate) Each explanation should be genuinely useful to its audience, not dumbed-down or padded. --- ## Part 14: Self-Assessment Which of the above sections was most difficult for you, and why? Which do you think you performed best on? What would you do differently if you had more time/tokens to work with? --- *End of prompt.*