Generative AI & Artificial General Intelligence (AGI)

Scraps from various sources and my own writings on Generative AI, AGI, Digital, Disruption, Agile, Scrum, Kanban, Scaled Agile, XP, TDD, FDD, DevOps, Design Thinking, etc.

Navigate

Home
Curriculum Vitae
Artificial Intelligence
Personal Blog
Experience
PO Interview Questions

Page Hits

Sunday, October 05, 2025

Instruction Tuning and Reinforcement Learning from Human Feedback (RLHF)

RLHF - rewards to generate higher reinforcements

at October 05, 2025

Email This BlogThis!Share to X Share to Facebook Share to Pinterest

No comments:

Post a Comment

Newer Post Older Post Home

Subscribe to: Post Comments (Atom)

If we already have automation, what's the need for Agents?

“Automation” and “agent” sound similar — but they solve very different classes of problems. Automation = Fixed Instruction → Fixed Outcome ...

V Model - Classical Lifecycle

Requirements Analysis -- Business requirements document or business requirements specification System Design -- Systems requireme...
2001 Snowbird Resort Utah Original Pictures

Search This Blog

About Me

Vijay Sudhakar: Australia

View my complete profile

Blog Archive

▼ 2025 (37)
- ▼ October 2025 (21)
- ► September 2025 (2)
- ► August 2025 (12)
- ► March 2025 (1)
- ► February 2025 (1)

► 2024 (40)
- ► June 2024 (5)
- ► May 2024 (14)
- ► April 2024 (18)
- ► March 2024 (3)

► 2023 (13)
- ► December 2023 (1)
- ► November 2023 (2)
- ► July 2023 (1)
- ► May 2023 (2)
- ► April 2023 (7)

► 2022 (1)
- ► July 2022 (1)

► 2021 (23)
- ► November 2021 (2)
- ► October 2021 (6)
- ► September 2021 (7)
- ► July 2021 (6)
- ► June 2021 (1)
- ► May 2021 (1)

► 2020 (92)
- ► November 2020 (2)
- ► September 2020 (1)
- ► August 2020 (10)
- ► July 2020 (51)
- ► June 2020 (18)
- ► May 2020 (3)
- ► April 2020 (1)
- ► March 2020 (2)
- ► February 2020 (4)

► 2019 (174)
- ► November 2019 (41)
- ► October 2019 (4)
- ► September 2019 (3)
- ► July 2019 (16)
- ► June 2019 (13)
- ► May 2019 (23)
- ► April 2019 (27)
- ► March 2019 (7)
- ► February 2019 (12)
- ► January 2019 (28)

► 2018 (90)
- ► December 2018 (26)
- ► November 2018 (12)
- ► October 2018 (13)
- ► September 2018 (8)
- ► August 2018 (18)
- ► July 2018 (5)
- ► May 2018 (1)
- ► April 2018 (3)
- ► February 2018 (3)
- ► January 2018 (1)

► 2017 (36)
- ► November 2017 (9)
- ► October 2017 (9)
- ► May 2017 (1)
- ► April 2017 (13)
- ► March 2017 (3)
- ► February 2017 (1)

► 2016 (2)
- ► March 2016 (1)
- ► February 2016 (1)

► 2015 (10)
- ► November 2015 (1)
- ► June 2015 (3)
- ► May 2015 (2)
- ► February 2015 (4)

► 2014 (11)
- ► August 2014 (1)
- ► July 2014 (7)
- ► May 2014 (1)
- ► April 2014 (2)

► 2013 (8)
- ► November 2013 (3)
- ► October 2013 (4)
- ► August 2013 (1)

► 2012 (7)
- ► November 2012 (5)
- ► September 2012 (2)

► 2011 (1)
- ► July 2011 (1)

► 2010 (19)
- ► November 2010 (1)
- ► August 2010 (1)
- ► March 2010 (2)
- ► February 2010 (14)
- ► January 2010 (1)

► 2009 (13)
- ► December 2009 (1)
- ► October 2009 (3)
- ► August 2009 (1)
- ► June 2009 (3)
- ► May 2009 (1)
- ► March 2009 (1)
- ► February 2009 (3)

► 2008 (13)
- ► September 2008 (1)
- ► August 2008 (9)
- ► February 2008 (3)

► 2007 (62)
- ► December 2007 (5)
- ► November 2007 (2)
- ► September 2007 (1)
- ► February 2007 (36)
- ► January 2007 (18)

► 2006 (1)
- ► November 2006 (1)

Report Abuse

bluespaceglobal.com.au

Labels

1-tier architecture (1) 2-tier architecture (1) 2018 (1) 3-tier architecture (1) 3rd Wave of Agile (1) 5 Trademarks of Agile Organization (1) 5 Why's (1) 5 Whys (1) 5S (1) 7 Principles of Software Testing (1) 8 D Problem-solving Technique (1) 8-wastes (1) 90% syndrome (1) Against Scrum; Scrum Disadvantages (1) AGI (1) Agile (35) Agile and Fractals (1) Agile antipatterns (1) Agile Budgeting (1) Agile Bugs (1) Agile cereal box (1) Agile Challenges (1) Agile Change (4) Agile Change management (4) Agile Coach (1) Agile Coaching (1) Agile Conflict Resolution (1) Agile Delivery (2) Agile Disruption (1) Agile ebooks (1) agile estimation (2) Agile Funding (1) agile journey (1) agile leadership (3) Agile Manifesto (1) Agile Metrics (1) Agile Mindset (1) Agile Model (1) agile organization (1) agile organizations (1) Agile Personality Traits (1) Agile Principles (1) agile project management (2) Agile Quality (1) Agile SAFe (1) Agile success factors (1) Agile Tenets (1) Agile testing (1) Agile Tools (1) Agile transformation (6) Agile Values (1) Agile with big A (1) Agile with small a (1) Agile WoW Risks (1) agility (4) Agility in IT Operating Model (1) AI (1) AIPodcast (1) Airbnb workflow (1) Ambiguity (1) American Retailer (1) AML (1) anti patterns (1) antipatterns (1) API (7) API Economy (1) API Motto (1) Application Architecture (2) Application Scaling (1) Approach to agile teams (1) Approach to Teams (1) Artificial Intelligence (7) Audit process (2) AUSTRAC (1) Authentication (1) Authorisation (1) Authorization (1) Automation (1) Autonomation (1) Azure (3) BA (1) BAAS (1) Banking Regulations (1) Basel (1) Basel 1 (1) Basel 123 (2) Basel 2 (1) Basel 3 (1) Basel accord (1) Basel I (1) Basel I II III (1) Basel II (1) Basel III (1) Basel Norms (1) Basic insurance claim lifecycle (1) BDD (1) Behavior Driven Development (1) Behavioural questions (1) Being Agile (1) Bell Curve (1) Benefits of Scrum (1) Bi-directional traceability (1) Big Data (6) Big Upfront Design (1) Bloom's Taxonomy (1) Books (1) Bradley Bug Chart (1) Branching and merging techniques (2) Budgeting (1) BUFD (1) bug life cycle (2) Bug Priority (1) Bug report (1) Bug Severity (1) Bug Tracking (1) Burnout (1) Business (1) Business Analysis (1) Business goal (1) Capabilities (1) Capex (1) Cargo Cult (1) CAS (1) Case (1) Case Type (1) CBPM (1) Central Administration (1) cereal box (1) chaku chaku (1) Chaku-chaku (1) Change (2) change management (4) Changing Mindset (1) ChatGPT (1) chip level programming (1) Choose www or no www (1) choosing candidates for agile transformation (1) christopher avery (1) claims (1) Cloud Computing (4) Cloud computing. Public cloud (1) Cloud Security (1) CM Audits (2) CMMI and ISO (1) CMMI High Maturity (1) CMMI V1.3 (1) CMMI V1.3 CONSTELLATIONS (1) CMMI V1.3 Webcast Notes (1) Coaching (1) Coaching vs. Consulting (1) CobiT (1) code maintainability (1) Code refactoring (1) code testability (1) Cognitive Bias (1) Commitment based management (1) Complex Adaptive Systems (1) Complexity (2) Complexity Model (1) Confidence (1) Configuration Management (2) Conflict Resolution (1) Consulting (1) container (1) Containerisation (2) Continuous Improvement (1) Contract (1) contracts (1) COQ (1) COSO - COBIT Relationship / Mapping (1) Cost of Ownership (1) Cost of Quality (1) Critical Review (1) Critical Review of Agile (1) CRM (1) CTF (1) culture (1) Culture and Value (1) Customer Journey Map (2) cyber security (2) Cybersecurity (2) Cyclomatic complexity (1) Data Analysis (1) Data Analytics (1) Data Governance (1) Data Loss Prefention (1) Data Observability (1) Data Science (1) Data Science Projects (1) Data Security Posture Management (1) Data-driven Design (2) dataset (1) DEEP (1) Defect (1) Defect Priority (1) Defect Removal Efficiency (1) Defect resolution in TxS (1) Defect Severity (1) Defect Triage (1) Definition of Done (2) Delay tasks (1) delegating (1) Deming (1) Design Process (1) Design Sprint (2) Design Thinking (21) Deterministic System (1) Developer (1) DevOps (4) DevSecOps (1) DevSecOps Technology (1) Digital (1) dimensions of trust (1) Disruption (2) Disruptive Innovation (1) disruptive technology (2) DLP (1) Do it Yourself Sprint (1) Docker (1) DoD (2) Doing Agile (1) DoR (1) Double Diamond (1) Double Donut (1) DRE (1) Drefys Model (1) Dreyfus (1) DSPM (1) Dual Track Agile (1) Dual-track Agile (1) DVF (2) e-books (1) eBooks (1) elaboration (1) elaboration sessions (1) Elicit and Analyse Requirements (1) embedded systems (1) embedded systems project management (1) Emerging Technologies (1) Empathy map (1) empowerment (1) End to End Design Process (1) Epic breakdown (1) Epics (2) Eric Ries (1) estimation failure (1) Evolutionary Delivery (1) Expand Work (1) Experience (1) FAAS (1) feature branches (1) feature buckets (1) Features (1) financial analysis (1) flow efficiency (1) Fractals (1) Function points (1) Funding (1) Gap analysis (1) Gartner (2) Gemba (1) Gestalt Principles (1) git (1) Git commands (1) Github (1) Google (1) Google AI (1) Graph Theory (1) Green belt and black belt projects (1) Gregory McDonald (1) habit (1) Hadoop (1) Hadoop Ecosystem (1) Handling Bugs in Agile (1) Hanedashi (1) Hansei (1) HCD (1) Horizontal Scaling (1) Hoshin Kanril Jidoka (1) hot fix (1) Hotel Reservation System (1) HP QC Administration (1) http (1) http/2 (1) Human Centered Design (1) Human Centred Design (1) hybrid cloud (1) Hype Cycle (1) hyper text (1) IAAS (1) IAG Agility (1) IAG Experience (3) IBM Rational Jazz... (1) Ideation (1) Ideation Workshop (2) IEEE (1) Impact Mapping (1) Impediments (1) Implementing Configuration Management for Software Testing Projects (1) Inception (1) Incremental (1) Incremental Delivery (1) Industry Testing (1) Infinite Game (1) Influencing agility (1) Information Technology Infrastructure for Service Management (1) Innovation (1) Innovator's Dilemma (2) Inspect & Adapt (1) Inspect and Adapt (1) Insurance (2) insurance claim lifecycle (2) Integration Testing (2) Intentional Programming (1) interview questions (2) Iron Triangle (1) Ishikawa Diagrams (1) ISO 12207 (1) Issues (1) IT Operating Model (1) Iteration (1) Iterative (1) ITSM (1) Jack Reeves (1) Japanese Lean Terminology (1) JavaScript (1) JavaScript Object Hierarchy (1) Jeijunka (1) JIT (1) JS Object Hierarchy (1) Jurgen Appelo (1) Just-in-time (1) Kaizen (1) Kanban (4) kano analysis (2) Kano Model (2) Key agile topics (1) key factors (1) Kubernetes (1) kubler-ross model (1) Large Language Model (3) lead (1) leadership (3) Lean (17) Lean books (1) Lean Coffee (1) Lean ebooks (1) Lean Enterprise (1) Lean pioneers (1) Lean six sigma (1) Lean Startup (4) Lean value stream mapping (1) Learning (1) Learning Model (1) Learning Models (1) Lehman's laws of software evolution (1) LLM (5) LLM Architecture (1) LMS (1) Lockheed Martin (1) M and A (1) Machine Learning (4) Management 3.0 (1) master branch (1) McKinsey (2) medical insurance (1) medical insurance claim (1) medical insurance workflow (1) Metrics for support projects (1) Microservices (1) Microservices vs. API (1) Microsoft Azure (2) Mike Coon (1) Mindmap (1) Mindset (2) Mindsets (1) Mizusumashi (1) Model (1) Mortgage Process (1) moscow (2) Moscow prioritisation (1) Muda (2) Muda Nagara (1) Mulesoft (1) Mulesoft ESB (1) Mura (2) Muri (2) MVP (3) Mythical man month (1) n-tier architecture (1) Narrative Paradigm (1) Natural Language Processing (1) NLP (1) NVIDIA (1) Objectives and Key Results (1) OKR (1) OLA (1) Opex (1) organisational agility (1) Organisational Change (1) organisational culture (1) Organization as living system (1) organizational agility (1) organizational change (2) organizational culture (1) organizational learning (1) organizations (1) Origins of design thinking (1) Overcoming Agile Challenges (1) PAAS (2) Parameters (1) Parkinson's Law (1) Paypal (1) Paypal Integration (1) Paypal Integration Testing (1) PCMM (1) PDCA (1) Pega (1) Penetration Testing (1) Persona (2) Personality Traits (1) Personas (1) PI Planning (1) planning (1) PO (1) POC (1) Poka Yoke (1) Poka-yoke (1) portfolio management (1) PPB PCB PROCESS PERFORMANCE BASELINE PROCESS CAPABITLITY BASELINE (1) PPQA (1) prasanna kumar cmmi sample questions (1) Pre-planning (1) Predictability (1) prioritization techniques (2) Priority matrix (1) private cloud (1) procedure (1) Process and product quality assurance (1) Process Area Categories (1) Process Assets Process Database (1) Process Capability Baseline (1) Process implementation (1) Process institutionalization (1) Processes (2) Procrastinate (1) Product (1) Product Development (2) product goal (1) Product Management (2) product owner (1) Product Owner Maturity (1) product prioritization (1) Product Trio (1) program management (1) Project Estimations during proposal phase (1) Project indicators (1) project management (1) Prompt Engineering (1) Prototype (1) Pull (1) Pull System (1) Quality (1) Quality Assurance (1) Quality Control (1) Quality in Agile (1) Quality Practices (1) Rancher (1) Rapid Delivery (1) RCA (1) RDD (1) refactor (1) refactoring (1) Regression Testing (1) relearning (1) release branching (1) Remote Desktop Services (1) Requirement Prioritization (1) Resistance to change (1) Resume Driven Development (1) retro (1) Retrospective (2) Retrospective Tools (1) Ringelmann effect (1) Risk Management (1) Risks and Issues (1) RM and RD (1) Role (1) Role of an Agile Coach (1) Roles & Responsibilities (1) Roles & Responsibilities of Agile Coach (1) Ron Jeffries (1) Root Cause Analysis (1) roundabouts (1) RSKM (1) Run charts and control charts (1) S/w program vs. industrial product (1) SAAS (2) SAFe (1) Safe Agile (1) SAFe Requirements Model (1) Salesforce (2) samle user stories (1) Sample epics (1) sample features (1) Sample Sprint report (1) Sample user story (1) Sanity Testing (1) Satir Change Model (1) Savioke (1) SBD (1) SCAMPI - A (1) Schema Repository (1) Scrum (10) Scrum anti patterns (1) Scrum Benefits (1) Scrum Blockers (1) Scrum Guide (1) Scrum Guide 2021 (1) Scrum Impediments (1) scrum master (2) Scrum Master Responsibilities (1) Scrum Master Roles & Responsibilities (1) scrum retrospectives (1) Scrum Tip (1) Scrumban (1) Sears Roebuck (1) SEBOK (1) Secure by Design (1) security (1) self-confidence (1) Selling Agile (1) servant leadership (2) Serverless (2) Shamrock Map (1) Shamrock Maps (1) Shift Left (1) Shojinka (1) Shu Ha Ri (1) Shuhari (1) Simon Wardley (1) Six Sigma (1) Skunk Works (1) Skunk Works Jet (1) SLA (1) SLM (2) Slow Thinking (1) Slow thinking manifesto (1) SM 360 Degree View (1) Small Language Model (1) Small LLM (1) Smoke Test (1) Snowbird Ski Resort (1) Social loafing (1) Software Design (1) Software Development (1) Software Quality (1) software testing (3) SOLID (1) Solid Principles (1) Soundcloud (1) Sources of Variation (1) Speed (1) Sprint (1) sprint goal (1) sprint planning (1) sprint report (1) SQA Interview Questions (1) SQL (3) Stack (1) Starting agile journey (1) Statistical available tests (1) Statistical tools - when to use what (1) steps for agile transformation (1) STLC (1) Stories (1) Story Mapping (3) Storyboarding (2) strategy (1) Strucgtured Query Language (1) Structured Query Language (2) Student Syndrome (1) success factors (1) Sustaining technology (1) sustaining vs disruptive technologies (1) System Testing (1) Systems Engineering (1) Systems Engineering Body of Knowledge (1) TDD (2) TDD Example (1) Technical Practice (1) Technology Stack (1) Telstra experience (1) Test Case after Coding (1) Test Driven Development (1) Test Planning & Execution (1) Test Process Imporvement (1) Test scripts (1) Tester (1) Testing after Code (1) Testing Defects (1) TFS (1) The third variable problem (1) Theory X and Theory Y (1) Three Amigos (1) TMMI (1) tokenization (1) tokens (1) Tool Administration (1) tools (1) Toyota (1) Toyota Personalised Rates (1) Toyota Production System (1) Toyota Way (1) TPI (1) TPR (1) TPS (2) Trademark (1) Traffic lights (1) Transformation (1) Trust (1) TxS (1) Unit Testing (1) Unpredictability (1) URI (1) URL (1) URN (1) Usability testing (1) User Acceptance Testing (1) User Experience (1) User Stories (1) user story (1) User Story Mapping (1) Utah (1) UX (1) UX Design (2) UX Sketch (1) V Model (1) Value (1) Value Proposition (1) Value Stream (2) Value Stream Map (1) Value stream mapping (4) Values & Principles (1) Verification (1) Verification and validation (1) Vertical Scaling (1) Virginia Satir (1) Virtual Machine (1) VM (1) Volatality (1) VSM (5) VUCA (1) Wardley Maps (1) Waterfall (1) Webservices (2) WinIT (1) WIP (1) Work in Progress (1) work instruction (1) Workflow (1) XML (2) XML tags (1) XP-80 Shooting Star (1) XSL (1) XSLT (1) YAGNI You Ain't Gonna Need It (1) Yokoten (1) ZBB (1) Zero Based Budget (1) Zero Based Budgeting (1)

If not "courtesided", rights belong to me. If i have forgotten to quote right source, pls contact.. Travel theme. Powered by Blogger.

Contact Form

Name

Email *

Message *