AI

Imagemap

hide

hide

2. Informed Search

leaf

leaf

leaf

hide

hide

Called E-Addmissabe

leaf

Returns solutions that are at most E worst than optimal

leaf

hide

4. Learning Agents

hide

Learning Functions

hide

Hypothisis = Approximation of function to be learnt

hide

Larger Hypothisis space, more likely to contain perfect function, but harder to find

leaf

Slower Convergance

leaf

More Examples Needed

hide

5. Sequential Decision Problems

hide

Utility depends on Sequence of Descions

leaf

Utilities only known in terminal states

hide

leaf

leaf

Smaller (absolute) value => More convervitive policy

leaf

Larger (absolute) value => more direct (riskier) policy

leaf

Reward(i) = StepCost(i) + Utility(If i terminal)

hide

Solving For Utility Values

hide

Bellmans Equation

leaf

U(i) = R(i) + Max(Over all actions a) { Sum(over all states j) {M(i,a,j)*U(j)} }

leaf

NonLinear (contains Maxes)

hide

Value Iterastion

leaf

U'(i) = R(i) + Max(Over all actions a) { Sum(over all states j) {M(i,a,j)*U(j)} }

leaf

Repear until U' close enough to U

leaf

Policy Iteration often faster than value iteration because
it is required for the utilities value to be the same
in order to end up with the same (optimal) policy

hide

Enviroment contains no terminal states
(Agent Lives Forever)

leaf

Additive Utilites become infinite, as paths become infinitely long

leaf

Value Iteration never terminated

hide

Discounting is an alterntive to Aditive utility

leaf

the Utility of a sequence of states: U(s0...sn)=R(sn)*y^n
For some constant y<=0

leaf

This has a solutiuon even for infinitely long chains of states

leaf

This factor y, can be added to our Bellman like equations, in direct, P.I, and V.I,
U(i) = R(i) + y*Max(Over all actions a) { Sum(over all states j) {M(i,a,j)*U(j)} }

hide

6. Reinforcement Learning

hide

leaf

If reward only at end of game, hard to know which moves were the good ones unless alot of games are played

hide

leaf

Can learn transition model

hide

Utility Learning

leaf

learns Utility Function

hide

leaf

Least Mean Square

hide

Algorithm:
At the end of some chain of states:
Move backwards though the chain: at state ei:

leaf

Reward-to-Go+=Reward(ei)

leaf

U(ei)=RunningAverage(U(ei), Reward-to-go)

leaf

Ignores the Model. State utilities are interdependent. this does not take advantage of the information

hide

leaf

Adaptive Dynamic Programming

leaf

Solve
U(i)=R(i) + Sum(over all j){M(i,j)U(j)}
M(i,j) is given to us as the model
Once enough sample sequences have been given,
we can solve this for U(i) (and R(i))

hide

Intractable in large spaces

leaf

Useful theortical benchmark

leaf

Similar to value determination in SDP

hide

hide

leaf

Implement start condions as postcondions of a Start action, and Goal conditions as Preconditions of End action. Start Order before End

leaf

Are there any open preconditions? If not Return plan

leaf

Choose a Open precondion C, for some step S

leaf

Add, or move an action A, that has C as a post condition of C, Ordering before S

leaf

Detect and Deal with Clobbering

hide

leaf

Defn: A new action C clobbers, exsiting actions A and B,
where postcondition of A meets a precondion for C,
if by interleaving ACB,
C would cause that post condition to no longer be met

hide

leaf

Demotion: Order C before A

leaf

Promotion: Order C after B

hide

8. Logical Agents

hide

hide

A Knowledge base KB, entails a sentenes S if
If the Models satifying KB, are a subset of the Models satisfying S

leaf

Things can satify S without satifying KB, but not the reverse

hide

Locally Equivelent

hide

Two sentences are logically equivelent iff they entail each other

leaf

ie the set of models satisfying one, is the same as the set of models satisfing the other

hide

hide

A system is sound, if it can only generate logical consequences

leaf

leaf

Ie All things infered can be entailed

hide

hide

A system is complete if it can generate all logical consiquences

leaf

Ie KB|=A => KB|-A

leaf

ie all things entailed can be inferred

hide

Inference Systems

hide

hide

Conjunctive normal form

hide

Knowlege base as a whole is a Conjuction of Disjunctions

leaf

a conjuction of clauses

leaf

leaf

Clauses are Disjuctions (Ors) of symbols (terms)

leaf

Symbols (terms) are positive or negitive propostions

hide

Resolution inference

leaf

Or Two clauses containing complimentairy terms. Result is the Or without those terms

hide

Satisfiability Problem

hide

Useful to find out if htere are models (and return one), for a KB

leaf

Especaillay if you Add to the KB something you want to know

leaf

Or its complement (for proof by contradiction)

leaf

hide

9. First Order Logic

hide

Situation Calculus

leaf

Can formulate Planning as inference on KB

hide

Situation Function: Result(a,s)

leaf

leaf

s is the situation function,

prior to a being taken

hide

hide

Pre(s) -> Post(Result(action,s))

leaf

Post (and Pre) are both predicates on the stituation

hide

Inference Systems

hide

hide

10. Knowledge Engineering

hide

Fundermental Dicontoimy in AI Logic

leaf

The More Expressive a Logic system, the harder (slower) it is to reason with

hide

Solve this with Domain orientated Logics

leaf

Remove unneeded features

leaf

Add needed ones