<aside> đź’ˇ The specification for an AI conceptual engineer

https://github.com/jerhadf/AI-CE

</aside>

Overview

Purpose

The AI Conceptual Engineer aims to assist and automate the process of conceptual engineering. This system would be a “copilot for the mind,” to automate and assist with the CE process. Using the vast linguistic training data of LLMs, along with more human input, it could build a deep and broad understanding of the concept. It could take a concept like <honesty> as input, and then decompose a concept, figure out its current function in our conceptual system, find its defects and benefits, and surface tensions or inconsistencies. Then it could help you improve the concept, explore alternatives, and generate new concepts. It could also help build a measurement system for the concept, gather data, run simulations and experiments, and elicit human feedback to empirically test and refine conceptual proposals. The goal is to augment and scale human conceptual engineering, in order to accelerate progress on conceptual issues related to AI alignment, fairness, transparency, and more.

Scope

The scope of this document includes the product outline, features, MVP stages, technical implementation, and a draft roadmap.

Features

Input and Initial Processing

Output: Applying Operations in CE